INTERNATIONAL SEARCH REPORT 



Inform Jl Application No 

PCT/GB 98/02048 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12Q1/68 



According to international Patent Classification (IPC? or to both national cjassiflcatton and IPC 
B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C12Q 



Documentation searched other than minimumdocumentation to the extent that such documents are included in the fields searched 



Electronic data base consutted during the international search (name of data base and, where practical, search terms used) 



C- DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication* where appropriate, of the relevant passages 



Relevant to claim No. 



WO 95 04160 A (ISIS INNOVATION ^SOUTHERN 
EDWIN (6B); CUMMINS WILLIAM JONATHAN (6B) 
9 February 1995 
see claims and. figures 

US 5 302 509 A (CHEESEMAN PETER C) 

12 April 1994 

see the whole document 

WO 96 32504 A (UN IV BOSTON) 
17 October 1996 

WO 96 33205 A (SPECTRAGEN INC) 

24 October 1996 

see the whole document 

-/-- 



1-26 



1-26 



1-26 



1-26 



LH 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



* Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

"V document which may throw doubts on priority daim(s) or 
which is cited to establish the pubtteattondate of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure* use, exhibition or 
other means 

"P" document published prior to the international fifing date but 
later than the priority data claimed 



later document published after the international fifing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed Invention 
cannot bo considered novel or cannot be constdared to 
involve an inventive step when the document is taken atone 

"Y* document of particular relevance; the claimed invention 

cannot be considered to invotve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
m the art. 

"JT document member of the same patent family 



Data of the actual completion of trteintemattonai search 



22 October 1998 



Oate of mailing of the international search report 



29/10/1998 



Name and malting address of the ISA 

European Patent Office, P.S. £618 Patenttaan 2 
NL-2280HV Rljswiik 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Mutler, F 



Form PCT/ISA/210 (second sh*wr) (Jury 1992) 



1 



INTERNATIONAL SEARCH REPORT 

ti««ormatton on patent family members 



Interna t Application No 

PCT/GB 98/02048 



Patent document 




Publication 


Patent family 




Publication 




ctteQ in searcn report 






members) 




data 




WO 9504160 


A 


09-02-1995 


AT 


159767 


T 


15-11-1997 










AU 


695349 


B 


13-08-1998 










AU 


7269194 


A 


28-02-1995 










CA 


2168010 


A 


09-02-1995 

V J Via* A ^<7tJ 










CN 


1131440 


A 


JL*J U JU 










DE 


69406544 


D 












DE 


69406544 


T 
i 












DK 


711362 


T 


22-12-1997 










EP 


0711362 


A 


15-05-1996 










EP 


0778280 


A 


11-06-1997 










ES 


2108479 


T 


16-12-1997 










FI 


Q60403 


a 


pQ-.ni— 1QQ£\ 










HU 


73802 


A 












JP 


9501830 


T 


25-02-1997 










NO 


960370 


A 


28-03-1996 










US 


5770367 


A 
rt 








A 
n 




NONE 










WO 9632504 


A 


17-10-1996 


AU 


5544696 


A 


30-10-1996 










CA 


2218188 


A 
M 












EP 


0830460 


A 
n 






WO 9633205 


A 


24-10-1996 


US 


5750341 


A 


12-05-1998 










AU 


5549096 


A 


07-11-1996 










CA 


2218017 


A 


24-10-1996 










EP 


0871646 


A 


21-10-1998 




WO 9727331 


A 


31-07-1997 


AU 


2247397 


A 


20-08-1997 










EP 


0868535 


A 


07-10-1998 





^omi PCT/ISA/21Q (pat*** lamiy anrwx) (Jtfy 1992) 




Copy for the Elected Office (EO/US) 

PATENT COOPERATION ^ATY 



PCT/GB98/02048 



From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF THE RECORDING 
OF A CHANGE 

(PCT Rule 92bis.1 and 
Administrative Instructions, Section 422) 


To: 

DANIELS, Jeffrey, Nicholas 
Page White & Farrer 
54 Doughty Street 
London WC1N 2LS 
ROYAUME-UNI 


Date of mailing (day/month/year) 

21 April 1999 (21.04.99) 


Applicant's or agent's file reference 

88208/JND 


IMPORTANT NOTIFICATION 


international application No. 

PCT/GB98/02048 


International filing date (day/month/year) 

13 July 1998(13.07.98) 



1. 


Th 
X 


e following indications appeared on record concerning: 
the applicant | | the inventor | | the agent | j the common representative 


Name and Address 

BRAX GENOMICS LIMITED 
13 Station Road 
Cambridge CB1 2JB 
United Kingdom 


State of Nationality 

GB 


State of Residence 

GB 


Telephone No. 


Facsimile No. 


Teleprinter No. 


2. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: 
| | the person | X| the name | | the address | | the nationality | | the residence 


Name and Address 

BRAX GROUP LIMITED 
13 Station Road 
Cambridge CB1 2JB 
United Kingdom 


State of Nationality 
GB 


State of Residence 
GB 


Telephone No. 


Facsimile No. 


Teleprinter No. 


3. Further observations, if necessary: 


4. 


A< 

"x 
~x 


3opy of this notification has been sent to: 

the receiving Office 

the International Searching Authority 

the International Preliminary Examining Authority 


X 


the designated Offices concerned 
the elected Offices concerned 
other: 





Authorized officer 




The International Bureau of WIPO 






34, chemin des Colombettes 


Yolaine CUSSAC 




1211 Geneva 20, Switzerland 






Facsimile No.: (41-22) 740,14.35 


Telephone No.: (41-22) 338.83.38 




Form PCT/IB/306 (March 1994) 




002578915 



m 



ATENT COOPERATION TRUTTY 



PCT/GB 98/02048 



From the INTERNATIONAL BUREAU 



rC 1 

MHTI pi r* AXIOM f\}P CI PPTIHM 

(PCT Rule 61.2) 


To: 

LJillieu Old It? o raiciU dilU 1 laUc?iriarK 

Office 
(Box PCT) 
Crystal Plaza 2 
Washington, DC 20231 
ETATS-UNIS D'AMERIQUE 

in its capacity as elected Office 


Date of mailing (day/month/year) 
14 April 1999 (14.04.99) 




International application No. 
PCT/GB98/02048 


Applicant's or agent's file reference 

88208/JND 


International filing date (day/month/year) 

13 July 1998(13.07.98) 


Priority date (day/month/year) 

11 July 1997 (11.07.97) 


Applicant 

SCHMIDT, Gunter etal 



1. The designated Office is hereby notified of its election made: 

in the demand filed with the International Preliminary Examining Authority on; 

05 February 1999 (05.02.99) 



□ in a notice effecting later election filed with the International Bureau on: 



The election 



00 
□ 



was 



was not 



made before the expiration of 1 9 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 



The International Bureau of WIPO 
34, chemin des Colombettes 
1211 Geneva 20, Switzerland 



Facsimile No.: (41-22) 740.14.35 



Authorized officer 

C. Carrie 

Telephone No.: (41-22) 338.83.38 



Form PCT/IB/331 (July 1992) 



2566334 



PATENT COOPERATION TREATY 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



Applicant's or agent's file reference 

88208/JND 


FOR FURTHER see Notification of Transmittal of International Search Report 

(Form PCT/ISA/220) as well as, where applicable, item 5 below. 

ACTION 


international application No. 

PCT/GB 98/02048 


International filing date (day/month/year) 

13/07/1998 


(Earliest) Priority Date (day/month/year) 

11/07/1997 


Applicant 

BRAX GENOMICS LIMITED et al . 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 



This International Search Report consists of a total of 



sheets. 



X | It is aiso accompanied by a copy of each prior art document cited in this report. 



1 . | | Certain claims were found unsearchable(see Box I). 

2. | | Unity of invention is lacking(see Box II), 



3. | | The international application contains disclosure of a nucleotide and/or amino acid sequence listing and the 

international search was carried out on the basis of the sequence listing 

|~| filed with the international application. 

| | furnished by the applicant separately from the international application, 

| [ but not accompanied by a statement to the effect that it did not include 
matter going beyond the disclosure in the international application as filed. 

|~| Transcribed by this Authority 



4. With regard to the title, [j£ 

□ 



the text is approved as submitted by the applicant 

the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 



□ 



the text is approved as submitted by the applicant 

the text has been established, according to Rule 38.2(b), by this Authority as it appears in 
Box ll L The applicant may, within one month fromthe date of mailing of this International 
Search Report, submit comments to this Authority. 



6. 



The figure of the drawings to be published with the abstract is: 

Figure No. 4 CZl as suggested by the applicant. 

j£] because the applicant failed to suggest a figure, 
| because this figure better characterizes the invention. 



None of the figures. 



Form PCT/ISA/210 (first sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International Application No 

PCT/GB 98/02048 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12Q1/68 



According to International Patent Classification (I PC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C12Q 



Documentation searched other than minimumdocumentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 Citation of document, with indication, where appropriate, of the relevant passages 




Relevant to claim No. 



A 



WO 95 04160 A (ISIS INNOVATION ; SOUTHERN 
EDWIN (GB); CUMMINS WILLIAM JONATHAN (GB) 
9 February 1995 
see claims and figures 

US 5 302 509 A (CHEESEMAN PETER C) 

12 April 1994 

see the whole document 

W0 96 32504 A (UNIV BOSTON) 
17 October 1996 

W0 96 33205 A (SPECTRAGEN INC) 

24 October 1996 

see the whole document 

-/-- 



1-26 



1-26 



1-26 



1-26 



LH 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



Special categories of cited documents 



"A" 
"E" 



"O" 
"P" 



document defining the general state of the art which is not 
considered to be of particular relevance 

earlier document but published on or after the international 
filing date 

document which may throw doubts on priority cfaim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

document referring to an oral disclosure, use, exhibition or 
other means 

document published prior to the international filing date but 
later than the priority date claimed 



"T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&" document member of the same patent family 



Date of the actual completion of theinternational search 



22 October 1998 



Date of mailing of the international search report 

29/10/1998 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV RijSWijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Authorized officer 



Muller, F 



Form PCT/ISA/21 0 (second sheet) (July 1 992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



International Application No 

PCT/GB 98/02048 



C^Continuation^POCUMENTS CONSIDERED TO BE RELEVANT 



Category ° 




Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO 97 27331 A (DARWIN MOLECULAR CORP 
;TAB0NE JOHN C (US); MULLIGAN JOHN T (US); 
H) 31 July 1997 

see whole document, esp examples and claims 



1-5 



Form PCT/lSA/21 0 (continuation ot second sheet) (July 1 992) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



International Application No 

PCT/GB 98/02048 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



AT 


159767 


T 


15-11-1997 


AU 


695349 


B 


13-08-1998 


AU 


7269194 


A 


28-02-1995 


CA 


2168010 


A 


09-02-1995 


CN 


1131440 


A 


18-09-1996 


DE 


69406544 


D 


04-12-1997 


DE 


69406544 


T 


26-02-1998 


DK 


711362 


T 


22-12-1997 


EP 


0711362 


A 


15-05-1996 


EP 


0778280 


A 


11-06-1997 


ES 


2108479 


T 


16-12-1997 


FI 


960403 


A 


29-01-1996 


HU 


73802 


A 


30-09-1996 


JP 


9501830 


T 


25-02-1997 


NO 


960370 


A 


28-03-1996 


US 


5770367 


A 


23-06-1998 



WO 9504160 



A 



09-02-1995 



US 5302509 A 12-04-1994 NONE 



W0 9632504 A 17-10-1996 AU 5544696 A 30-10-1996 

CA 2218188 A 17-10-1996 
EP 0830460 A 25-03-1998 



W0 9633205 A 24-10-1996 US 5750341 A 12-05-1998 

AU 5549096 A 07-11-1996 

CA 2218017 A 24-10-1996 

EP 0871646 A 21-10-1998 



W0 9727331 A 31-07-1997 AU 2247397 A 20-08-1997 

EP 0868535 A 07-10-1998 



Form PCT/ISA/21 0 (patent family annex) (July 1 992) 




PATENT COOPERATION TREATY 



rmE 



From the: 

INTERNATIONAL PRELIMINARY EXAMINING AUTHORITY 



To: 

DANIELS, J. 

PAGE WHITE & FARRER 
54 Doughty Street 
LONDON WC1N2LS 
GRANDE BRETAGNE 



A U "'I 1999 

PCT • 



WRITTEN OPINION 



(PCT Rule 66) 



Date of mailing 

(day/month/y&ar) ] 2. 87. 



Applicants or agenfs file reference 
8^QS/JND 


REPLY DUE within 3 month(s) 

from the above date of mailing 


International application No. 
PCT/GB98/02O48 


International filing date (day/month/year) 
13/07/1998 


Priority date (day/month/year) 
11/07/1997 


International Patent Classification (IPC) or both national classification and IPG 
C12Q1/68 


Applicant 

BRAX GROUP LIMITED et al 



1 . This written opinion is the first drawn up by this International Preliminary Examining Authority. 

2. This opinion contains indications relating to the following items: 



I 




II 


□ 


III 


□ 


IV 


□ 


V 




VI 




VII 




VIII 


E#^3 



Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



The applicant is hereby invited to reply to this opinion. 
When? 



How? 



Also: 



See the time limit indicated above. The applicant may, before the expiration of that time limit, 
request this Authority to grant an extension, see Rule 66.2(d). 

By submitting a written reply, accompanied, where appropriate, by amendments, according to Rule 66.3. 
For the form and the language of the amendments, see Rules 66.8 and 66.9. 

For an additional opportunity to submit amendments, see Rule 66.4. 

For the examiner's obligation to consider amendments and/or arguments, see Rule 66.4 bis. 

For an informal communication with the examiner, see Rule 66.6. 



If no reply Is filed, the international preliminary examination report wilt be established on the basis of this opinion. 
The final date by which the international preliminary 

examination report must be established according to Rule 69.2 is: 11/11/1 999. 



Name and mailing address of the international 
preliminary examining authority: 
European Patent Office 

D-80298 Munich 
Tel. (+49-89) 2399-0 Tx: 523656 epmu d 

Fax: (+49-89) 2399-4465 



Authorized officer / Examiner 
GONCALVES M L F C 



Formalities officer (incl. extension of time limits) 

Danti, B 

Telephone No. (+49-89) 2399 8161 




Form PCT/IPEA/408 (cover sheet) (January 1994) 



• 




WRITTEN OPINION 



International application No. PCT/GB98/02048 



I. Basis of the opinion 

1 . This opinion has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office 
in response to an invitation under Article 14 are referred to in this opinion as "originally filed".): 



Description, pages: 



1-51 



as originally filed 



Claims, No.: 

1-26 



as originally filed 



Drawings, sheets: 



1/14-14/14 



as originally filed 



2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. This opinion has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 



4. Additional observations, if necessary: 



V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or industrial 
applicability; citations and explanations support! 

1. Statement 

Novelty (N) Claims 1, 14-26 (no) 2-1 3 (yes) 

Inventive step (IS) Claims 1 -26 (no) 

Industrial applicability (I A) Claims 1-26 (yes) 



2. Citations and explanations 
see separate sheet 
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and / or 
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VII. Certain defects in the international application 
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VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
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Section V 

1 . Reference is made to the following documents: 
D1: WO-A-95 04160 T"< 
D2: WO-A-96 32504 . T>- 
D3: WO-A-98 1 5652 §^ 
D4: WO-A-97 27331 



2. The subject-matter of claim 1 is a method for characterising DNA, 
comprising steps (i) to (iv). Methods for characterising DNA comprising 
the same method steps as the method of ■clairrrl appear to be*dfsclosed 
in the following prior art documents: 

D1 describes a reagent comprising: (i) an analyte moiety containing two 
analyte residues; and linked to (ii) a tag moiety containing one reporter 
group adapted for detection by mass spectrometry; the reporter group 
designates an analyte residue, and the reporter group at each portion of 
the tag moiety is chosen to designate an analyte residue at a defined 
position on the analyte. Also disclosed in D1 are a library of reagents each 
comprising a different analyte moiety and an assay method comprising: (i) 
providing a target substance; (ii) incubating the target substance with the 
library of reagents under conditions to cause binding of one reagent to the 
target substance; (iii) removing un-bound reagents; (iv) recovering the tag 
moieties of each bound reagent; and (v) analysing the tag moieties as an 
indication of the nature of the analyte moieties bound to the target 
substance; etc. The reagents and libraries can be used for nucleic acid 
sequencing and for detecting particular analytes, e.g. for drug screening. 
The method of D1 has good sensitivity and resolution.. The mass labelling 
combines the advantages of radioactivity and fluorescence (see claims). 
D2 describes a method of sequencing or detecting a target nucleic acid, 
which comprises: (1) providing a set of nucleic acid fragments containing 
a sequence corresponding to a target sequence (optionally by fragmenting 
the target sequence), and hybridizing these to an array (optionally 
attached to a solid support when the fragments are obtained by 
fragmenting the target sequence) of nucleic acid probes (optionally mass 
modified), comprising a double and single stranded portion containing a 
variable sequence to form a target nucleic acid array, then determining the 
sequence of the target nucleic acid by: (a) determining molecular weights 



Form PCT/Separate Sheet/408 (Sheet 1) (EPO-Apri! 1997) 



WRITTEN OPINION 
SEPARATE SHEET 



International application No. PCT/GB98/02048 



for several nucleic acids of the target array; (b) extending and mass 
modifying a strand of the probe using the hybridised fragment as a 
template, and determining molecular weights for several mass modified 
extended nucleic acids by mass spectrometry; (c) extending a strand of 
the mass modified probes using the hybridised fragments as templates, 
and determining molecular weights for a plurality of extended mass 
modified strands; (d) extending a strand of the probe enzymatically using 
the hybridised fragment as a template to create an extended nucleic acid, 
removing alkali cations from the extended nucleic acid and determining 
mol. weight/for several protonated and extended nucleic acids* by mass 
spectrometry; or (e) extending a strand of the probe using the hybridised 
fragment as a template and determining mol. weight, for several nucleic 
acids of the target array; (2) providing a set of partially single stranded 
nucleic acid fragments containing, a target sequence, and hybridizing the 
single stranded portions of the fragments to single stranded portions of a 
set of partially double stranded nucleic acid probes to form a set of 
complexes (optionally containing a variable sequence within the single 
stranded region), determining the sequence of the target nucleic acid by 
ligating a single strand of the fragment to an adjacent single strand of the 
probe, and:(a) extending the unligated strand of the complex by strand- 
displacement polymerisation using the ligated strand as a template; or (b) 
extending the unligated strand of the complex by strand displacement 
polymerisation using the ligated strand as a template, mass modifying the 
extended strand and determining the molecular weights of the extended 
strands by mass spectrometry; or (3) providing a set of nucleic acids 
complementary or homogenous to a sequence of the target, hybridizing 
the set to an array of single stranded nucleic acid probes, comprising a 
constant sequence and a determinable variable sequence and identifying 
the target sequence by determining molecular weights of hybridised 
nucleic acids. The methods are useful in the identification, analysis and 
diagnosis of diseases and disorders, and in determining relationships 
between organisms. The methods of this document, based on positional 
sequencing by hybridisation (PSBH), with their ability to bind and 
discriminate between different sequences with large or small arrays of 
probes is well suited to mass spectrometric analysis, unlike conventional 
methods based on base by base sequence identification (see abstract and 
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claims). 

From the above analysis of the prior art, the subject-matter of claim 1 does 
not appear to be novel (Article 33(2) PCT). 

/ J » 

. ; ■ ■ . ■ 

3. The aditional features added by the dependent claims 14-20 to the 
method of claim 1 refer to standard options in the art and thus cannot 
contribute to the novelty of the claimed subject-matter (Article 33(2) PCT). 
The aditional features introduced by the dependent claims 2-13 to the 
method of claim 1 are not explicitly disclosed in the prior art documents 
and therefore are novel. Because these additiorral features do not appear 
to result in any unexpected efects whatsoever, they can not contribute to 
the inventiveness of the subject-matter of claim 1 (Article 33(3) PCT). 

4. The above comments also apply to the subject-matter of claims 22, 23 and 
24 (referring to preferred embodiments of the method of claim 1) and to 
claims 25 and 26 (referring to the use of probes or oligonucleotide in the 
claimed methods) and to the claims dependent thereon. 

Section VI 

1. The document WO-A-98 1 5652, from the same applicants, discloses the 
nucleic acid sequencing by adaptor ligation. The method for sequencing 
nucleic acids comprises: 

- (a) obtaining a target nucleic acid population comprising nucleic acid 
fragments in which each fragment is present in a unique amount and bears 
a sticky end sequence of predetermined length and unknown sequence; 

- (b) protecting the other end of each fragment, and 

- (c) sequencing each of the fragments by: 

- (i) contacting the fragments with an array of adaptor oligonucleotides 
(ONs) under hybridisation conditions, each adaptor ON bearing a label, a 
sequencing enzyme recognition site, and a known unique base sequence 
of the same predetermined length as the sticky base sequence, the array 
containing all possible base sequences of that predetermined length, 
removing any unhybridised adaptor ON and recording the quantity of any 
hybridised adaptor ON by detection of the label, then repeating the cycle, 

until all of the adaptors in the array have been tested; 

- (ii) contacting the hybridised adaptor ONs with a sequencing enzyme 
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which binds to the recognition site and cuts the fragment to expose a new 
sticky end sequence which is contiguous with or overlaps the previous 
sticky end sequence; 

- (Hi) repeating steps (i) and (ii) for a number of times and determining the 
sequence of the fragment by comparing the quantities recorded for each 
sticky end sequence. The method can be used for the simultaneous 
sequencing of a heterogeneous population of nucleic acid fragments 
without spatially resolving them (see abstract and claims). 
WO-A-97 27331 from the Darwin Molecular Corp., describes a method for 
determining the sequence of a nucleic acid molecule which comprises: (a) 
generating tagged nucleic acid fragments which are complementary to a 
selected target nucleic acid, where a tag is correlative with a particular 
nucleotide and detectable by non-fluorescent spectrometry or poten- 
tiometry; (b) separating the tagged fragments by sequential length; (c) 
cleaving the tags from the tagged fragments; and (d) detecting the tags by 
non-fluorescent spectrometry or potentiometry, and from this determining 
the sequence. 

Also disclosed in this document are compounds of the formula Tms-L-X 
and where: Tms - an organic group detectable by mass spectrometry 
comprises carbon, at least one of H and F, and optional atoms selected 
from O, N, S, P and I; L = an organic group which allows a Tms-containing 
moiety to be cleaved from the remainder of the compound, where the 
Tms-containing moiety comprises a functional group which supports a 
single ionised charge state when the compound is subjected to mass 
spectrometry and is selected from tertiary amine, quaternary amine and 
organic acid; and X= a functional group selected from hydroxyl, amino, 
thio, carboxylic acid, haloalkyl and derivatives of these which activate or 
inhibit the activity of the group toward coupling with other moieties, or is 
a nucleic acid fragment attached to L at other than the 3 1 end of the 
fragment; provided that the compound is not bonded to a solid support 
through X nor has a mass of less than 250 Daltons (see abstract and 
claims). 

The methods and compounds described may be used e.g. for determining 
the sequences of nucleic acid molecules. They may be used for 
determination of multiple nucleic acid sequences simultaneously. The 
methods and compounds of WO-A-97 27331 allow sequencing to be 
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performed with enhanced speed and sensitivity. 

These documents, therefore, would appear to disclose the subject-matter 

of claims 1,21, 22, 23 and 24 of the present application. 

However, it is assumed that the priority date of the present application is 

validly claimed. The present priority date of 1 1/7/97 is, namely, before the 

publication dates of 1 6/4/98(WO-A-98 15652) and 3 1/7/97 (WO- A- 97 

27331). 

Section VII 

1. In order for the application to be self-contain'ed (see further 'the PCT 
Guidelines, Oil, 4.17), the references to patent applications numbers 
should be changed to publication numbers throughout the description. 

Section VIII 

1. Although the groups of claims 1 , 21, 22, 23 and 24 and 25, 26 have been 
drafted as separate independent claims, they appear to relate effectively 
to the same subject-matter and to differ from each other only with regard 
to the definition of the subject-matter for which protection is sought and/or 
in respect of the terminology used for the features of that subject-matter. 
The aforementioned claims therefore lack conciseness. Moreover, lack of 
clarity of the claims as a whole arises, since the plurality of independent 
claims makes it difficult, if not impossible, to determine the matter for which 
protection is sought, and places an undue burden on others seeking to 
establish the extent of the protection. 

Hence, claims 1 , 21, 22, 23 , 24 and 25, 26 do not meet the requirements 
of Article 6 PCT. 

In order to overcome this objection, it would appear appropriate to file an 
amended set of claims defining the relevant subject-matter in terms of a 
single independent claim in each category followed by dependent claims 
covering features which are merely optional (Rule 6.4 PCT). 

2. The applicant attention is called to the fact that there appear to be more 
international applications from the same applicant (for example PCT/US 
98/02044) designating the same states and the claims of those 
applications have the same priority date and relate to the same invention, 
thus the applicant may be required in the national phase to choose which 
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one of those applications he whishes to proceed to grant (see further the 
PCT guidelines, C-IV, 6.3). 
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1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of 7 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.1 6 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 
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Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 
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Certain defects in the international application 

Certain observations on the international application 
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Date of completion of this report 


Name and mailing address of the international 
preliminary examining authority: 

^ European Patent Office 
mft D-80298 Munich 

Tel. +49 89 2399 - 0 Tx: 523656 epmu d 
Fax: +49 89 2399 - 4465 
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I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed' and are not annexed to 
the report since they do not contain amendments.): 

Description, pages: 

1 -51 as originally filed 

Claims, No.: 

1 -26 as originally filed 

Drawings, sheets: 

1/14-14/14 as originally filed 

2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 
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V. 



Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 



Statement 



Novelty (N) 



Yes: 
No: 



Claims 
Claims 



1 



26 



Inventive step (IS) 



Yes: 
No: 



Claims 
Claims 



1 



26 



Industrial applicability (IA) Yes: Claims 1-26 

No: Claims 

2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1. Certain published documents (Rule 70.10) 
and / or 

2. Non-written disclosures (Rule 70-9) 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 
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Section V 

1 . The following documents are considered to form the most relevant prior art: 
D1: WO-A-95 04160 
D2: WO-A-98 15652 
D3: WO-A-97 27331 



2, The subject-matter of claim 1 is a method for characterising DNA, comprising steps 
(i) to (iv). Methods for characterising DNA comprising the same method steps as the 
method of claim 1 appear to be disclosed in the following prior art document: 

D1 describes a reagent comprising: (i) an analyte moiety containing two analyte 
residues; and linked to (ii) a tag moiety containing one reporter group adapted for 
detection by mass spectrometry; the reporter group designates an analyte residue, 
and the reporter group at each portion of the tag moiety is chosen to designate an 
analyte residue at a defined position on the analyte. Also disclosed in D1 are a 
library of reagents each comprising a different analyte moiety and an assay method 
comprising: (i) providing a target substance; (ii) incubating the target substance with 
the library of reagents under conditions to cause binding of one reagent to the target 
substance; (iii) removing un-bound reagents; (iv) recovering the tag moieties of each 
bound reagent; and (v) analysing the recovered tag moieties as an indication of 
the nature of the analyte moieties bound to the target substance; etc. The tag moiety 
is suitable for analysis by mass spectrometry after cleavage from the reagent 
(see page 3, lines 6-9). The reagents and libraries can be used for nucleic acid 
sequencing and for detecting particular analytes, e.g. for drug screening. The 
method of D1 has good sensitivity and resolution. The mass labelling combines the 
advantages of radioactivity and fluorescence (see claims). 

From the above analysis of D1 , the subject-matter of claim 1 distinguishes itself from 
the method of this document by including the step of "separaring the fragments on 
the basis of there length" and is therefore novel (Article 33(2) PCT). Because the 
distinguishing feature does not appear to result in any unexpected technical effects 
whatsoever (on the contrary this is an additional step to the method of D1), the 
subject-matter of claim 1 is not considered to be based on an inventive concept 
(Article 33(3) PCT). 

3. The additional features added by the dependent claims 14-20 to the method of 
claim 1 refer to standard options in the art and thus cannot contribute to the 
inventiveness of the claimed subject-matter (Article 33(3) PCT). 
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The additional features introduced by the dependent claims 2-13 to the method of 
claim 1 are not explicitly disclosed in the prior art documents and therefore are novel. 
Because these additional features do not appear to result in any unexpected effects 
whatsoever, they can not contribute to the inventiveness of the subject-matter of 
claim 1 (Article 33(3) PCT). 

4. The above comments also apply to the subject-matter of claims 21, 22, 23 and 24 
(referring to preferred embodiments of the method of claim 1 ) and to claims 25 and 
26 (referring to the use of probes or oligonucleotide in the claimed methods) and to 
the claims dependent thereon (Article 33(3) PCT). 

Section VI 

1 . The document D2, from the same applicants, discloses the nucleic acid sequencing 
by adaptor ligation. The method for sequencing nucleic acids comprises: 

- (a) obtaining a target nucleic acid population comprising nucleic acid fragments in 
which each fragment is present in a unique amount and bears a sticky end sequence 
of predetermined length and unknown sequence; 

- (b) protecting the other end of each fragment, and 

- (c) sequencing each of the fragments by: 

- (i) contacting the fragments with an array of adaptor oligonucleotides (ONs) under 
hybridisation conditions, each adaptor ON bearing a label, a sequencing enzyme 
recognition site, and a known unique base sequence of the same predetermined 
length as the sticky base sequence, the array containing all possible base sequences 
of that predetermined length, removing any unhybridised adaptor ON and recording 
the quantity of any hybridised adaptor ON by detection of the label, then repeating 
the cycle, until all of the adaptors in the array have been tested; 

- (ii) contacting the hybridised adaptor ONs with a sequencing enzyme which binds 
to the recognition site and cuts the fragment to expose a new sticky end sequence 
which is contiguous with or overlaps the previous sticky end sequence; 

- (iii) repeating steps (i) and (ii) for a number of times and determining the sequence 
of the fragment by comparing the quantities recorded for each sticky end sequence. 
The method can be used for the simultaneous sequencing of a heterogeneous 
population of nucleic acid fragments without spatially resolving them (see abstract 
and claims). 

D3 from the Darwin Molecular Corp., describes a method for determining the 
sequence of a nucleic acid molecule which comprises: (a) generating tagged nucleic 
acid fragments which are complementary to a selected target nucleic acid, where a 
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tag is correlative with a particular nucleotide and detectable by non-fluorescent 
spectrometry or potentiometry; (b) separating the tagged fragments by sequential 
length; (c) cleaving the tags from the tagged fragments; and (d) detecting the tags 
by non-fluorescent spectrometry or potentiometry, and from this determining the 
sequence. 

Also disclosed in this document are compounds of the formula Tms-L-X and where: 
Tms = an organic group detectable by mass spectrometry comprises carbon, at least 
one of H and F, and optional atoms selected from O, N, S, P and I; L = an organic 
group which allows a Tms-containing moiety to be cleaved from the remainder of the 
compound, where the Tms-containing moiety comprises a functional group which 
supports a single ionised charge state when the compound is subjected to mass 
spectrometry and is selected from tertiary amine, quaternary amine and organic acid; 
and X= a functional group selected from hydroxy!, amino, thio, carboxylic acid, 
haloalkyl and derivatives of these which activate or inhibit the activity of the group 
toward coupling with other moieties, or is a nucleic acid fragment attached to L at 
other than the 3' end of the fragment; provided that the compound is not bonded to 
a solid support through X nor has a mass of less than 250 Daltons (see abstract and 
claims). 

The methods and compounds described may be used e.g. for determining the 
sequences of nucleic acid molecules. They may be used for determination of 
multiple nucleic acid sequences simultaneously. The methods and compounds of 
WO-A-97 27331 allow sequencing to be performed with enhanced speed and 
sensitivity. 

These documents, therefore, would appear to disclose the subject-matter of claims 
1 , 21 , 22, 23 and 24 of the present application. 

However, it is assumed that the priority date of the present application is validly 
claimed. The present priority date of 1 1/7/97 is, namely, before the publication dates 
of 16/4/98(WO-A-98 15652) and 3 1/7/97( WO-A-97 27331). Documents D2 and D3 
could become relevant for the assessment of novelty in a future regional phase 
before the EPO. 

Section VII 

1 . In order for the application to be self-contained (see further the PCT Guidelines, C-ll, 
4.17), the references to patent applications numbers should have been changed to 
publication numbers throughout the description. 

Section VIII 
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1 . Claims 1, 21, 22, 23 , 24 and 25, 26 do not meet the requirements of Article 6 PCT. 
Because claims 21, 22, 23 and 24 comprise all the features of claim 1 , these claims 
should have been formulated as claims dependent on the latter (Rule 6.4 PCT). 

2. The applicant attention is called to the fact that there appear to be more international 
applications from the same applicant (for example PCT/US 98/02044) designating 
the same states and the claims of those applications have the same priority date 
and relate to the same invention, thus the applicant may be required in the national 
phase to choose which one of those applications he whishes to proceed to grant 
(see further the PCT guidelines, C-IV, 6.3). 
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From the INTERNATIONAL SEARCHING AUTHORITY 


PCT 

NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL SEARCH REPORT 
OH inc UcoLAHAl ION 

(PCT Rule 44.1) 

*• 


To: 

PAGE WHITE & FARRER 
Attn/ DANIELS, J, 
54 Doughty Street 
LONDON WCIN 2LS 

IINTTFD KINGDOM -^tfS 


Date of mailing 

(dayfrnonth/year) 29/10/1998 


Applicants or agent's file reference 


FOR FURTHER ACTION See paragraphs 1 and 4 below 


International application No. 

PCT/GB 98/02048 


Internationa] filing date 
(day/month/year) 13/07/1998^ 


Applicant 

BRAX GENOMICS LIMITED et al . 



1 . [jf] The applicant is hereby notified that the International Search Report has been established and is transmitted herewith. 
Filing of amendments and statement under Article 19 

The applicant is entitled, if he so wishes, toamend the claims of the International Application (see Rule 46): 

When? The time limit for filing such amendments is normally 2 months from the date of transmittal of the 
International Search Report: however, for more details, see the notes on the accompanying sheet. 



Where? Directly to the 



International Bureau of WIPO 
34. chemin des Coiombettes 
121 1 Geneva 20. Switzerland 
Fasctmile No.: (41-22) 740.14.35 



For more detailed instructions, see the notes on the accompanying sheet. 

2. I | The applicant is hereby notified that no International Search Report will be established and that the declaration under 
' — ' Article 17(2)(a) to that effect is transmitted herewith. 



3. Q With regard to the protest against payment of (an) additional fee(s) under Rule 40.2, the applicant is notified that; 

□ the protest together with the decision thereon has been transmitted to the International Bureau together with the 
applicants's request to forward the texts of boththe protest and the decision thereon to the designated Offices. 

[ | no decision has been made yet on the protest; the applicant will be notified as soon as a decision is made. 

4. Further action(s): The applicant is reminded of the following: 

Shortly after 18 months from the priority date, the international application will be published by the International Bureau. 
If the applicant wishes to avoid or postpone publication, a notice of withdrawal of the international application, or of the 
priority claim, must reach the International Bureau as provided in Rules 90o7s.1 and 90b/s.3, respectively, before the 
completion of the technical preparations for international publication. 

Within 19 months from the priority date, a demand for international preliminary examination must be filed if the applicant 
wishes to postpone the entry into the national phase until 30 months from the priority date (in some Offices even later). 

Within 20 months from the priority date, the applicant must perform the prescribed acts for entry into the national phase 
before all designated Offices which have not been elected in the demand or in a later election within 19 months from the 
priority date or could not be elected because they are not bound by Chapter IL 



Name and mailing address of the International Searching Authority 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280HV Rijswijk 

Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31 -70) 340-301 6 



Authorized officer 

Deborah Grandis 
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These Notes are intended to give the basic instructions concerning the filing of amendments under article 1 9. The 
Notes are based on the requirements of the Patent Cooperation Treaty, the Regulations and the Administrative Instructions 
under that Treaty, tn case of discrepancy between these Notes and those requirements, the latter are applicable. For more 
detailed information, see also the PCT Applicant's Guide, a publication of W1PO. 

In these Notes, "Article", "Rule", and "Section" refer to the provisions of the PCT, the PCT Regulations and the PCT 
Administrative Instructions respectively. 



INSTRUCTIONS CONCERNING AMENDMENTS UNDER ARTICLE 19 



The applicant has, after having received the international search report, one opportunity to amend the claims of the 
international application. It should however be emphasized that, since all parts of the international application (claims, 
description and drawings) may be amended during the international preliminary examination procedure, there is usually 
no need to file amendments of the claims under Article 19 except where, e.g. the applicant wants the latter to be published 
for the purposes of provisional protection or has another reason for amending the claims before international pbufication. 
Furthermore, ft should be emphasized that provisional protection is available in some States only. 



What parts of the International application may be amended? 

Under Article 1 9, only the claims may be amended. 

During the international phase , the claims may also be amended (or further amended) under Article 34 before 
the International Preliminary Examining Authority. The description and drawings may only be amended under 
Article 34 before the International Examining Authority. 

Upon entry into the national phase, all parts of the international application may be amended under Article 28 
or, where applicable, Article 41 . 



When? Within 2 months from the date of transmittal of the international search report or 1 6 months from the priority 

date, whichever time limit expires later. It should be noted, however, that the amendments will be considered 
as having been received on time if they are received by the International Bureau after the expiration of the 
applicable time limit but before the completion of the technical preparations for international publication 
(Rule 46.1). 



Where not to file the amendments? 

The amendments may only be filed with the International Bureau and not with the receiving Office or the 
International Searching Authority (Rule 46.2). 

Where a demand for international preliminary examination has been /is filed, see below. 



How? Either by cancelling one or more entire claims, by adding one or more new claims or by amending the text of 

one or more of the claims as filed. 

A replacement sheet must be submitted for each sheet of the claims which, on account of an amendment or 
amendments, differs from the sheet originally filed. 

All the claims appearing on a replacement sheet must be numbered in Arabic numerals. Where a claim is 
cancelled, no renumbering of the other claims is required. In all cases where claims are renumbered, they must 
be renumbered consecutively (Administrative Instructions, Section 205(b)). 

The amendments must be made In the language In which the International application is to be published. 



What documents must/may accompany the amendments? 
. Letter (Section 205(b)): 

The amendments must be submitted with a letter. 

The letter will not be published with the international application and the amended claims. It should not be 
confused with the "Statement under Article 1 9(1)" (see below, under "Statement under Article 19(1)*). 

The letter must be In English or French, at the choice of the applicant However, If the language of the 
International application la English, the letter must be In English; if the language of the international application 
Is French, the letter must be In French. 
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The fetter must indicate the differences between the claims as filed and the claims a3 amended. It must, in 
particular, indicate, in connection with each claim appearing in the international application (it being understood 
that identical indications concerning several claims may be grouped), whether 

(i) the claim is unchanged; 

(ii) the claim is cancelled; 

(iii) the claim is new; 

(iv) the claim replaces one or more claims as filed; 

(v) the claim is the result of the division of a claim as filed. 



The following examples Illustrate the manner in which amendments must be explained fn the 
accompanying letter: 

1. (Where originally there were 48 claims and after amendment of some claims there are 51 J: 
"Claims 1 to 29, 31 , 32, 34. 35, 37 to 48 replaced by amended claims bearing the same numbers; 
claims 30, 33 and 36 unchanged; new claims 49 to 51 added." 

2. [Where originally there were 15 claims and after amendment of alfd aims there are 11): - *.' 
"Claims 1 to 15 replaced by amended claims 1 to 11* 

3. [Where originally there were 14 claims and the amendments consist in cancelling some claims and in adding 
new claims]: 

"Claims 1 to 6 and 1 4 unchanged; claims 7 to 1 3 cancelled; new claims 1 5, 16 and 17 added." or 
"Claims 7 to 13 cancelled; new claims 1 5, 16 and 17 added; all other claims unchanged." 

4. (Where various kinds of amendments are madej: 

"Claims 1*10 unchanged; claims 1 1 to 1 3, 18 and 19 cancelled; claims 1 4, 1 5 and 16 replaced by amended 
claim 1 4; claim 17 subdivided into amended claims 15, 1 6 and 1 7; new claims 20 and 21 added.* 



"Statement under article 19(1)" (Rule 46.4) 

The amendments may be accompanied by a statement explaining the amendments and indicating any impact 
that such amendments might have on the description and the drawings (which cannot be amended under 
Article 19(1)). 

The statement wilt be published with the international application and the amended claims. 
It must be In the language in which the international appplication Is to be published. 

It must be brief, not exceeding 500 words if in English or if translated into English. 

It should not be confused with and does not replace the letter indicating the differences between the claims 
as filed and as amended. It must be filed on a separate sheet and must be identified as such by a heading, 
preferably by using the words "Statement under Article 19(1)." 

It may not contain any disparaging comments on the international search report or the relevance of citations 
contained in that report. Reference to citations, relevant to a given claim, contained in the international search 
report may be made only in connection with an amendment of that claim. 



Consequence If a demand for International preliminary examination has already been filed 

if, at the time of filing any amendments under Article 1 9, a demand for international preliminary examination 
has already been submitted, the applicant must preferably, at the same time of filing the amendments with the 
International Bureau, also file a copy of such amendments with the International. Preliminary Examining 
Authority (see Rule 62.2(a), first sentence). 



Consequence with regard to translation of the International application for entry Into the national phase 

The applicant's attention is drawn to the fact that, where upon entry into the national phase, a translation of the 
claims as amended under Article 19 may have to be furnished to the designated/elected Offices, instead of, or 
in addition to, the translation of the claims as filed. 

For further details on the requirements of each designated/elected Office, see Volume II of the PCT Applicant's 
Guide. 



Notes to Form PCT/ISA/220 (second sheet) (January 1994) 



W PATENT COOPERATION TR^jP* 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



Applicant's or agenf s file reference . 

88208/JND 


FOR FURTHER see Notification of Transmittal of International Search Report 

(Form PCT/iSA/220) as well as, where applicable, item 5 below. 

ACTION 


International application No. 

PCT/GB 98/02048 


international filing date (day/month/year) 

13/07/1998 


(Earliest) Priority Date (ctay/month/year) 

11/07/1997 


Applicant 

BRAX GENOMICS LIMITED et al. 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. ■ 



This international Search Report consists of a total of 



sheets. 



fy~| it is also accompanied by a copy of each prior art document cited in this report. 



1. Certain claims were found unsearchable (see Box i). 

2. Q Unity of invention is lacking(see Box II). 



3. The international application contains disclosure of a nucleotide and/or amino acid sequence listing and the 

international search was carried out on the basis of the sequence listing 

| | filed with the international application. 

[~[ furnished by the applicant separately from the international application, 

[~~] but not accompanied by a statement to the effect that it did not include 

matter going beyond the disclosure in the international application as filed. 

| [ Transcribed by this Authority 



4. With regard to the title, 



Y] the text is approved as submitted by the applicant 

| the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

the text is approved as submitted by the applicant 

| [ the text has been established, according to Rule 38.2(b), bythis Authority as it appears in 
Box III. The applicant may, within one month from the date of mailing of this International 
Search Report, submit comments to this Authority, 



6. The figure of the drawings to be published with the abstract is: 

Figure No. 4 Q as suggested by the applicant. Q None of the figures. 

"X~[ because the applicant failed to suggest a figure. 
"*~| because this figure better characterizes the invention. 
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A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12Q1/68 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C12Q 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



A 
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the report since they do not contain amendments.): 

Description, pages: 

1-51 as originally filed 

Claims, No.: 

1-26 as originally filed 

Drawings, sheets: 

1/14-14/14 as originally filed 
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□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 
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1. Statement 



Novelty (N) 


Yes: 


Claims 


'1-26 




No: 


Claims 




Inventive step (IS) 


Yes: 


Claims 






No: 


Claims 


1-26 


Industrial applicability (IA) 


Yes: 


Claims 
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No: 


Claims 





2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1. Certain published documents (Rule 70.10) 

and / or - 

2. Non-written disclosures (Rule 70.9) 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 
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Section V 

1 . The following documents are considered to form the most relevant prior art: 
D1: WO-A-95 04160 

D2: WO-A-98 15652 
D3: WO-A-97 27331 

2. The subject-matter of claim 1 is a method for characterising DNA, comprising steps 
(i) to (iv). Methods for characterising DNA comprising the same method steps as the 
method of claim 1 appear to be disclosed in the following prior art.document: 

D1 describes a reagent comprising: (i) an analyte moiety containing two analyte 
residues; and linked to (ii) a tag moiety containing one reporter group adapted for 
detection by mass spectrometry; the reporter group designates an analyte residue, 
and the reporter group at each portion of the tag moiety is chosen to designate an 
analyte residue at a defined position on the analyte. Also disclosed in D1 are a 
library of reagents each comprising a different analyte moiety and an assay method 
comprising: (i) providing a target substance; (ii) incubating the target substance with 
the library of reagents under conditions to cause binding of one reagent to the target 
substance; (iii) removing un-bound reagents; (iv) recovering the tag moieties of each 
bound reagent; and (v) analysing the recovered tag moieties as an indication of 
the nature of the analyte moieties bound to the target substance; etc. The tag moiety 
is suitable for analysis by mass spectrometry after cleavage from the reagent 
(see page 3, lines 6-9). The reagents and libraries can be used for nucleic acid 
sequencing and for detecting particular analytes, e.g. for drug screening. The 
method of D1 has good sensitivity and resolution. The mass labelling combines the 
advantages of radioactivity and fluorescence (see claims). 

From the above analysis of D1 , the subject-matter of claim 1 distinguishes itself from 
the method of this document by including the step of "separating the fragments on 
the basis of there length" and is therefore novel (Article 33(2) PCT). Because the 
distinguishing feature does not appear to result in any unexpected technical effects 
whatsoever (on the contrary this is an additional step to the method of D1), the 
subject-matter of claim 1 is not considered to be based on an inventive concept 
(Article 33(3) PCT). 

3. The additional features added by the dependent claims 1 4-20 to the method of 
claim 1 refer to standard options in the art and thus cannot contribute to the 
inventiveness of the claimed subject-matter (Article 33(3) PCT). 
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The additional features introduced by the dependent claims 2-1 3 to the method of 
claim 1 are not explicitly disclosed in the prior art documents and therefore are novel. 
Because these additional features do not appear to result in any unexpected effects 
whatsoever, they can not contribute to the inventiveness of the subject-matter of 
claim 1 (Article 33(3) PCT). 

4. The above comments also apply to the subject-matter of claims 21, 22, 23 and 24 
(referring to preferred embodiments of the method of claim 1) and to claims 25 and 
26 (referring to the use of probes or oligonucleotide in the claimed methods) and to 
the claims dependent thereon (Article 33(3) PCT). ~ 

Section VI 

1. The document D2, from the same applicants, discloses the nucleic acid sequencing 
by adaptor ligation. The method for sequencing nucleic acids comprises: 

- (a) obtaining a target nucleic acid population comprising nucleic acid fragments in 
which each fragment is present in a unique amount and bears a sticky end sequence 
of predetermined length and unknown sequence; 

- (b) protecting the other end of each fragment, and 

- (c) sequencing each of the fragments by: 

- (i) contacting the fragments with an array of adaptor oligonucleotides (ONs) under 
hybridisation conditions, each adaptor ON bearing a label, a sequencing enzyme 
recognition site, and a known unique base sequence of the same predetermined 
length as the sticky base sequence, the array containing all possible base sequences 
of that predetermined length, removing any unhybridised adaptor ON and recording 
the quantity of any hybridised adaptor ON by detection of the label, then repeating 
the cycle, until all of the adaptors in the array have been tested; 

- (ii) contacting the hybridised adaptor ONs with a sequencing enzyme which binds 
to the recognition site and cuts the fragment to expose a new sticky end sequence 
which is contiguous with or overlaps the previous sticky end sequence; 

- (iii) repeating steps (i) and (ii) for a number of times and determining the sequence 
of the fragment by comparing the quantities recorded for each sticky end sequence. 
The method can be used for the simultaneous sequencing of a heterogeneous 
population of nucleic acid fragments without spatially resolving them (see abstract 
and claims). 

D3 from the Darwin Molecular Corp., describes a method for determining the 
sequence of a nucleic acid molecule which comprises: (a) generating tagged nucleic 
acid fragments which are complementary to a selected target nucleic acid, where a 
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tag is correlative with a particular nucleotide and detectable by non-fluorescent 
spectrometry or potentiometry; (b) separating the tagged fragments by sequential 
length; (c) cleaving the tags from the tagged fragments; and (d) detecting the tags 
by non-fluorescent spectrometry or potentiometry, and from this determining the 
sequence. 

Also disclosed in this document are compounds of the formula Tms-L-X and where: 
Tms = an organic group detectable by mass spectrometry comprises carbon, at least 
one of H and F, and optional atoms selected from O, N, S, P and I; L = an organic 
group which allows a Tms-containing moiety to be cleaved from the remainder of the 
compound, where the Tms-containing moiety comprises a functional group which 
supports a single ionised charge state when the compound is subjected to mass 
spectrometry and is selected from tertiary amine, quaternary amine and organic acid; 
and X= a functional group selected from hydroxyl, amino, thio, carboxylic acid, 
haloalkyl and derivatives of these which activate or inhibit the activity of the group 
toward coupling with other moieties, or is a nucleic acid fragment attached to L at 
other than the 3' end of the fragment; provided that the compound is not bonded to 
a solid support through X nor has a mass of less than 250 Daltons (see abstract and 
claims). 

The methods and compounds described may be used e.g. for determining the 
sequences of nucleic acid molecules. They may be used for determination of 
multiple nucleic acid sequences simultaneously. The methods and compounds of 
WO-A-97 27331 allow sequencing to be performed with enhanced speed and 
sensitivity. 

These documents, therefore, would appear to disclose the subject-matter of claims 
1 , 21, 22, 23 and 24 of the present application. 

However, it is assumed that the priority date of the present application is validly 
claimed. The present priority date of 1 1/7/97 is, namely, before the publication dates 
of 1 6/4/98(WO- A-98 15652) and 3 1/7/97( WO-A-97 27331). Documents D2 and D3 
could become relevant for the assessment of novelty in a future regional phase 
before the EPO. 

Section VII 

1. In order for the application to be self-contained (see further the PCT Guidelines, C-ll, 
4.17), the references to patent applications numbers should have been changed to 
publication numbers throughout the description. 

Section VIII 
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1. Claims 1, 21, 22, 23 , 24 and 25, 26 do not meet the requirements of Article 6 PCT. 
Because claims 21, 22, 23 and 24 comprise all the features of claim 1 , these claims 
should have been formulated as claims dependent on the latter (Rule 6.4 PCT). 

2. The applicant attention is called to the fact that there appear to be more international 
applications from the same applicant (for example PCT/US 98/02044) designating 
the same states and the claims of those applications have the same priority date 
and relate to the same invention, thus the applicant may be required in the national 
phase to choose which one of those applications he whishes to proceed to grant 
(see further the PCT guidelines, C-IV, 6.3). ~ 
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(57) Abstract 

A method for characterising DNA, which comprises: (i) providing a population of DNA fragments, each fragment having cleavably 
attached thereto a mass label for identifying a feature of that fragment; (ii) separating the fragments on the basis of their length; (iii) cleaving 
each fragment in a mass spectrometer to release its mass label; and (iv) determining each mass label by mass spectroscopy to relate the 
feature of each fragment to the length of the fragment. 
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Characterising Nucleic Acids 

The present invention relates to a method for characterising DNA, 
especially to obtain sequence information. 

Conventional DNA sequencing uses a DNA polymerase to add numerous 
dideoxy/deoxynucleotides to an oligonucleotide primer, annealed 
to a single stranded DNA template, in a template specific manner. 
Random termination of this process is achieved when terminating 
nucleotides , i.e. the dideoxynucleotides , are incorporated in 
the template complement. In this way, a series of fragments is 
produced containing all possible lengths of the template 
complement. A 'DNA ladder' is produced when the randomly 
terminated strands are separated on a denaturing polyacrylamide 
gel. Sequence information is gathered, following polyacrylamide 
gel electrophoresis, by detecting the * DNA ladder' either through 
incorporating a radioactive isotope or fluorescent label into one 
of the nucleotides or the primer used in the reaction. A 
particular drawback with this technology is its dependence on 
conventional gel electrophoresis, to separate the DNA fragments 
in order to deduce sequence information, as this is a slow 
process taking up to nine hours to complete . 

W097/27331 describes methods for determining nucleic acid 
sequences in which mass spectrometry is used to detect tags from 
tagged nucleic acid fragments complementary to a selected target 
nucleic acid molecule. W097/27331 proposes cleaving tagged 
fragments by photolysis followed by subsequent mass spectroscopic 
determination of purified tags. This method suffers from a 
number of drawbacks, including the need for expensive equipment 
to perform the photolysis step. 

The present invention aims to overcome the drawbacks of the prior 
art . 

Summary of the Invention 

The present invention provides a method for characterising DNA, 
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which comprises: 

(i) providing a population of DNA fragments, each fragment 
having cleavably attached thereto a mass label for identifying 
a feature of that fragment; 

(ii) separating the fragments on the basis of their length; 

(iii) cleaving each fragment in a mass spectrometer to release its 
mass label ; and 

(iv) determining each mass label by mass spectroscopy to 
relate the feature of each fragment to the length of the 
fragment . 

By cleaving each fragment within the spectrometer the present 
invention possesses advantages over the use of cleavage outside 
the mass spectrometer, for example by chemical or photolytic 
cleavage* By designing mass labels which cleave within the mass 
spectrometer the need for expensive laser equipment or an 
additional cleavage chamber or interface between separating the 
fragments and determining the mass label to mass spectrometry is 
avoided. Cleavage in the mass spectrometer may take place using 
the ionisation process to induce fragmentation of each mass 
label. This is described in further detail below. For example, 
each mass label may be cleavably attached to its respective 
fragment by a linker which is cleavable in the mass spectrometer. 

The method of characterising DNA according to the present 
invention finds particular application in DNA sequencing. In one 
aspect, the method further comprises: 

(a) providing at least one DNA single -stranded template 
primed with a primer; and 

(b) generating the population of DNA fragments from the at 
least one template, wherein the population comprises at least one 
series of DNA fragments, the or each series containing all 
possible lengths of a second strand of DNA complementary to the 
or each template; 

wherein the feature of each fragment determined by each mass 
label relates to a nucleotide or nucleotide sequence at one end 
of each fragment, so that each nucleotide is related to a 
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position in the template associated with the mass label so as to 
deduce the sequence of the or each template - 

In a preferred arrangement, the step of separating the fragments 
is effected by capillary electrophoresis „ Capillary 
electrophoresis systems are amenable to microf abrication which 
avoid many of the problems associated with conventional gel 
electrophoresis, most notable of which is the time required for 
separation of bands. In a micro-fabricated capillary, this can 
be of the order of minutes rather than hours. Further, because 
the separation medium can be liquid, loading of capillaries can 
be easily automated. 

The present invention can therefore avoid the limiting step in 
conventional sequencing techniques of resolution of the fragment 
population generated by the polymerisation in the presence of 
blocking nucleotides. The polymerase reaction is simple and 
relatively fast and can be readily performed in parallel. This 
is advantageous as it increases throughput. The rapidity, in 
comparison with the iterative approaches described in GB9620769.1 
and GB9700760.1, reduces the time available for secondary 
structure formation in the templates which can impede chemical 
and biological steps in the sequencing process . 

Thus, by removing the slab gel electrophoretic steps of the prior 
art, the overall speed of cDNA sequencing can be increased 
significantly by using capillary electrophoresis and mass 
spectrometry to derive sequence information. 

In one arrangement, the series of DNA fragments is provided by 
contacting the template in the presence of DNA polymerase with 
a mixture of nucleotides sufficient for hybridising to the 
template for forming a second strand of DNA complementary to the 
template, wherein the mixture further comprises a set of four 
probes containing all four nucleotides for hybridising to the 
template in which the nucleotide of each probe comprises a 
modified nucleotide which is capable of polymerising to the 
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second strand of DNA but blocked to prevent further 
polymerisation thereto and which is cleavably attached to the 
mass label, which mass label is uniquely resolvable in mass 
spectrometry for identifying the modified nucleotide, and wherein 
each fragment is terminated with one of the probes. 

In a further arrangement, at least one template is a plurality 
of templates and the series of DNA fragments is provided by 
contacting each template in a separate reaction zone in the 
presence of DNA polymerase with a mixture of nucleotides 
sufficient for hybridising to the template for forming a second 
strand of DNA complementary to the template, wherein the mixture 
further comprises a set of four probes containing all four 
nucleotides for hybridising to the template in which the 
nucleotide of each probe comprises a modified nucleotide which 
is capable of polymerising to the second strand of DNA but 
blocked to prevent further polymerisation thereto and which is 
cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
modified nucleotide, wherein each fragment is terminated with one 
of the probes, and wherein each set of mass labels from each set 
of four probes associated with each reaction zone is different 
from the other sets of mass labels; and the fragments are pooled 
before step (ii) . 

The present invention is a development of earlier patent 
applications on primer extension sequencing (GB 9620769.1 and GB 
9700760.1). The earlier inventions describe methods for 
sequencing where a template sequence is determined in an 
iterative process. The processes described in these earlier 
patents are potentially limited in the read- length achievable by 
the efficiency of the chemical and biological reactions occuring 
in each iteration of the process. 

The sequencing process of the present invention allows one to 
sequence large numbers of Sanger ladder populations, generated 
in parallel, simultaneously in a fully automated process. This 
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invention thus includes a method for automating the preparation 
of a large number of Sanger sequencing reactions allowing a large 
number of templates to be sequenced in parallel. 

In a preferred arrangement, the primed DNA may be immobilised on 
a solid phase support such as a bead or the well of a microtitre 
plate . 

The mixture of nucleotides used in the method typically comprises 
ATP, TTP, CTP and GTP although their analogues may also be used. 
All four nucleotides are usually required to ensure that they are 
sufficient for forming a second strand of DNA to hybridise to the 
template * 

Preferably, the modified nucleotides are dideoxy- or 
deoxynucleotides which may be added in a concentration suitable 
to ensure random incorporation into the polymerase reaction. In 
this way, the DNA ladder of fragments may be produced. 

The mass labels may be uniquely resolvable in mass spectrometry 
in the sense that their mass/charge ratio and/or fragmentation 
patterns characterise each mass label uniquely so that it may be 
assigned to a corresponding probe and thereby a corresponding 
modified nucleotide so that the nucleotide present in the target 
template sequence may be deduced because it hybridises with the 
modified nucleotide of the probe. For a simple non-multiplexed 
method for sequencing DNA it is sufficient to have only four 
probes, each with its respective modified nucleotide. In other 
words, four separate mass labels would be required in this simple 
method. In a multiplexing method, however, multiples of four 
probes are typically required, the mass labels of each group of 
four being different from one another. Each group of four mass 
labels must also be different from the other groups of mass 
labels so that it is clear which mass label is associated with 
which reaction zone. 

The reaction zones of the present invention are typically 
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separate containers and may conveniently be microtitre wells in 
a microtitre plate. 

In one arrangement according to the present invention a single 
primed DNA single- stranded template is present in each reaction 
zone- However, it is possible to have up to four different 
templates in each separate reaction zone although one or two 
templates are preferred. Where more than one template is used 
in each reaction zone the same four probes are used. 
Statistically it is possible still to assign a unique sequence 
to each template. Preferably, the relative concentrations of 
each template are known in this method. This is discussed in 
further detail below. 

In a further arrangement, the at least one template is a 
plurality of templates and the series of DNA fragments is 
provided by contacting each template in a separate reaction zone 
in the presence of DNA polymerase with a mixture of nucleotides 
sufficient for hybridising to the template for forming a second 
strand of DNA complementary to the template, wherein the mixture 
further comprises a probe . containing only one of the four 
nucleotides for hybridising to the template, the nucleotide of 
which probe comprises a modified nucleotide which is capable of 
polymerising to the second strand of DNA but blocked to prevent 
further polymerisation thereto, wherein each fragment is 
terminated with the probe and wherein either the primer or the 
modified nucleotide of the probe is cleavably attached to the 
mass label, which mass label is associated with the reaction zone 
and uniquely resolvable in mass spectrometry from the mass label 
in the other reaction zones for identifying the modified 
nucleotide used in the reaction zone; and the fragments are 
pooled before step (ii) . 

In accordance with this method each separate reaction zone 
contains only one of the four possible modified nucleotides 
attached to a probe to terminate the polymerisation process. 
Thus, in each individual reaction zone a DNA ladder corresponding 
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to a template sequence terminating in either As, Ts, Gs or Cs 
would be formed. By repeating the method for each of the four 
nucleotides in separate reaction zones the full sequence of the 
template may be deduced. In this method either the probe or the 
primer may have cleavably attached thereto its corresponding mass 
label . 

In a further arrangement, the at least one template is a 
plurality of templates and the series of DNA fragments is 
provided by contacting the plurality of templates in each of four 
separate reaction zones in the presence of DNA polymerase with 
a mixture of nucleotides sufficient for hybridising to the 
template for forming a second strand of DNA complementary to the 
template, wherein the mixture further comprises a probe 
containing in each of the reaction zones only one of the four 
nucleotides for hybridising to the template, the nucleotide of 
which probe comprises a modified nucleotide which is capable of 
polymerising to the second strand of DNA but blocked to prevent 
further polymerisation thereto, wherein each fragment is 
terminated with the probe and wherein the primer is cleavably 
attached to the mass label, which mass label is associated with 
the primer and uniquely resolvable in mass spectrometry from the 
mass labels associated with the other primers used in the 
reaction zone; and wherein each nucleotide from its corresponding 
reaction zone is related to its position in the template. 

In a further arrangement, the at least one template is four sets 
of DNA single-stranded templates, each set comprising an 
identical plurality of DNA single- stranded templates and the 
series of DNA fragments is provided by contacting each set in a 
separate reaction zone in the presence of DNA polymerase with a 
mixture of nucleotides sufficient for hybridising to the 
templates for forming a second strand of DNA complementary 
thereto, wherein the mixture further comprises a probe containing 
in each of the reaction zones only one of the four nucleotides 
for hybridising to the template, the nucleotide of which probe 
comprises a modified nucleotide which is capable of polymerising 
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to the second strand of DNA but blocked to prevent further 
polymerisation thereto, wherein each fragment is terminated with 
the probe and wherein each of the templates of the four sets is 
primed with a primer to which the mass label is cleavably 
attached, which mass label which uniquely resolvable in mass 
spectrometry from the mass labels corresponding to the other 
templates and which is relatable to its respective template and 
its respective reaction zone, wherein the fragments are pooled 
before step (ii) , and each nucleotide from its corresponding 
reaction zone is related to its position in the template. 

The plurality of single- stranded templates may be primed by 
hybridising to a known sub- sequence common to each of the 
templates an array of primers each comprising a base sequence 
containing a common sequence complementary to the known sub- 
sequence and a variable sequence of common length, usually in the 
range 2 to 6 , preferably in the range 2 to 4, more preferably 3. 
The array contains all possible sequences of that common length 
and the mass label cleavably attached to each primer is relatable 
to the variable sequence. The variable sequence is relatable to 
the particular template to be sequenced. 

In a further arrangement, the at least one template is a 
plurality of templates and the series of DNA fragments is 
provided by contacting each set of templates in a separate 
reaction zone in the presence of DNA polymerase with a mixture 
of nucleotides sufficient for hybridising to the templates for 
forming a second strand of DNA complementary thereto, wherein the 
mixture further comprises a set of four probes containing all 
four nucleotides for hybridising to the template in which the 
nucleotide of each probe comprises a modified nucleotide which 
is capable of polymerising to the second strand of DNA but 
blocked to prevent further polymerisation thereto and which is 
cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
modified nucleotide, wherein each fragment is terminated with one 
of the probes, and wherein each set of mass labels from each set 
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of four probes associated with each reaction zone is different 
from the other sets of mass labels and, before step (ii) , the 
fragments are pooled and the pooled fragments are sorted 
according to a sub-sequence having a common length of 3 to 5 
bases adjacent to the primer to form an array of groups of sorted 
fragments, wherein each group is spatially separated from the 
other groups . 

In a preferred arrangement, the step of sorting the pooled 
fragments comprises contacting the fragments with an array of 
spatially separate oligonucleotides each comprising a base 
sequence containing a common sequence complementary to the primer 
sequence of the fragments and a variable sequence of the common 
length, which array contains all possible variable sequences of 
the common length. The common length is preferably 4, in which 
case an array of 256 spatially separate oligonucleotides is 
required * The array of spatially separate oligonucleotides is 
conveniently a hybridisation array and may comprise a 
hybridisation chip. 

In an alternative arrangement, the series of DNA fragments is 
provided by 

(i) contacting the template in the presence of DNA polymerase 
with a mixture of nucleotides sufficient for hybridising to the 
template for forming a second strand of DNA complementary to the 
template, wherein the mixture further comprises a set of four 
probes containing all four nucleotides for hybridising to the 
templates in which the nucleotide of each probe comprises a 
modified nucleotide which is capable of polymerising to the 
second strand of DNA but reversibly blocked to prevent further 
polymerisation thereto, wherein the step of contacting forms a 
series of templates containing all possible lengths of the second 
strand of DNA, each second strand terminated with one of the 
probes ; 

(ii) removing unpolymerised nucleotides; 

(iii) unblocking the modified nucleotides; and 

(iv) contacting the series of templates with an array of 
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oligonucleotide probes, wherein each oligonucleotide probe has 
a nucleotide sequence of common length 2 to 6 , all combinations 
of sequences are present in the array, and wherein each probe is 
cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
nucleotide sequence . 

In accordance with this arrangement, the series of templates 
forms a DNA ladder to which the oligonucleotide probes may be 
ligated. This method may be extended in a further arrangement 
in which the at least one template is a plurality of primed DNA 
single- stranded templates, each at a unique concentration, and 
the series of DNA fragments is provided by 

(i) contacting the templates in the presence of DNA 
polymerase with a mixture of nucleotides sufficient for 
hybridising to the template for forming a second strand of DNA 
complementary to the templates, wherein the mixture further 
comprises a set of four probes containing all four nucleotides 
for hybridising to the templates in which the nucleotide of each 
probe comprises a modified nucleotide which is capable of 
polymerising to the second strand of DNA but reversibly blocked 
to prevent further polymerisation thereto, wherein the step of 
contacting forms a series of templates containing all possible 
lengths of the second strand of DNA, each second strand 
terminated with one of the probes; 

(ii) removing unpolymerised nucleotides; 

(iii) unblocking the modified nucleotides; and 

(iv) contacting the series of templates with an array of 
oligonucleotide probes, wherein each oligonucleotide probe has 
a nucleotide sequence of common length 2 to 6 , all combinations 
of sequences are present in the array, and wherein each probe is 
cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
nucleotide sequence * 

The nucleotide probes used to extend the DNA ladder hybridised 
to the series of templates preferably has a sequence of common 
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length 3 to 5, most preferably 4 nucleotides. In this way, 256 
sequences would be present in the array. 

The mass label to which each probe is cleavably attached in these 
alternative methods need not be cleaved in a mass spectrometer 
and can be cleaved outside the mass spectrometer, for example by 
photocleavage or chemical cleavage. 

In a further alternative method, the series of DNA fragments is 
provided by contacting the template in the presence of DNA ligase 
with a mixture of oligonucleotides sufficient for hybridising to 
the template for forming a second strand of DNA complementary to 
the template, the oligonucleotides each having a common length 
in the range 2 to 6, wherein the mixture further comprises a set 
of probes containing all possible oligonucleotides of the common 
length 1 for hybridising to the templates in which the 
oligonucleotide of each probe comprises a modified 
oligonucleotide which is capable of ligating to the second strand 
of DNA but blocked to prevent further ligation thereto and which 
is cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
modified oligonucleotide, and the series of fragments contains 
all possible lengths of the second strand of DNA of integer 
multiples of 1, in which each fragment is terminated with one of 
the probes . 

The at least one template may be a plurality of primed DNA 
single- stranded templates each at a unique concentration. In 
this particular embodiment, the series of DNA fragments is 
provided by contacting the templates in the presence of DNA 
ligase with a mixture of oligonucleotides sufficient for 
hybridising to the templates for forming a second strand of DNA 
complementary to the templates, the oligonucleotides each having 
a common length in the range 2 to 6 , wherein the mixture further 
comprises a set of probes containing all possible 
oligonucleotides of the common length 1 for hybridising to the 
templates in which the oligonucleotide of each probe comprises 
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a modified oligonucleotide which is capable of ligating to the 
second strand of DNA but blocked to prevent further ligation 
thereto and which is cleavably attached to the mass label, which 
mass label is uniquely resolvable in mass spectrometry for 
identifying the modified oligonucleotide, and the series of 
fragments contains all possible lengths of the second strand of 
DNA of integer multiples of 1, in which each fragment is 
terminated with one of the probes . 

L is preferably 3 to 5, most preferably 4, and once again, the 
mass labels to which the oligonucleotide probes are cleavably 
attached need not be cleaved within the mass spectrometer and 
could be cleaved outside, for example by photolysis or chemical 
cleavage. 

In these further alternative arrangements, instead of producing 
a DNA ladder with a spacing of one base, a DNA ligase is used and 
a spacing of 2 to 6 bases is used. This is therefore a ligase 
chain reaction. 

In a further aspect the present invention provides a set of 
nucleotide probes for use in a method of sequencing DNA from a 
primed DNA single- stranded template, which set of probes contains 
all four nucleotides for hybridising to the template, wherein the 
nucleotide of each probe comprises a modified nucleotide which 
is capable of polymerising to a second strand of DNA 
complementary to the template but blocked to prevent further 
polymerisation thereto, which modified nucleotide is cleavably 
attached to a mass label for identifying the modified nucleotide, 
and wherein each mass label when released from the probe is 
uniquely resolvable in relation to every other mass label in the 
set by mass spectrometry and is relatable to its corresponding 
modified nucleotide. 

The set of probes may comprise a plurality of sub- sets of probes , 
each sub- set containing all four nucleotides for hybridising- to 
the template . 
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In a further aspect the present invention provides a set of 
oligonucleotide primers, each of which comprises a mass label 
cleavably attached to an oligonucleotide primer base sequence for 
hybridising to a DNA single -stranded template to form a primed 
template, wherein each mass label of the set, when released from 
the primer, is uniquely resolvable in relation to every other 
mass label in the set by mass spectrometry and is relatable to 
the oligonucleotide primer base sequence. 

The mass label of each probe or primer may be attached to the 
modified nucleotide or primer by a cleavable linker which may be 
cleaved under any appropriate cleavage conditions such as 
photocleavage conditions or chemical cleavage conditions. The 
mass labelled probes or primers may be made in accordance with 
any standard methodology including the methodology disclosed in 
PCT/GB9 8/00127 of 15th January 1998 filed by the present 
applicants . 

The present invention will be described in further detail by way 
of example only with reference to the accompanying drawings, in 
which : 

FIGURE 1 shows how PGR primers for each cloned fragment may be 
used in amplification; 

FIGURE 2 shows the use of biotinylated PCR primers in fragment 
amplification; 

FIGURE 3 shows the production of an immobilised single stranded 
template from double stranded PCR product; 

FIGURES 4a and 4b show diagrammatically the method of a preferred 
embodiment of the present invention; 

FIGURES 5 to 10 show diagrammatically the methods of further 
preferred embodiments of the present invention; 

FIGURE 11 shows a schemmatic diagram of an orthogonal time of 
flight mass spectrometer suitable for use in the present 
invention; 

FIGURE 12 to 14 show diagrammatically a method of an alternative 
embodiment of the present invention using reversably blocked 



WO 99/02728 



PCT/GB98/02048 



- 14 - 

nucleotide probes ; and 

FIGURE 15 shows the results of the method of Figures 12 to 14 . 
Examples 

Automated preparation of heterogenous template populations: 

In order to produce a high throughput DNA sequencing technology 
the automation of the production of the sequencing template is 
highly desirable. The following is an outline which describes an 
automated method of producing sequencing templates for using in 
the present invention. 

For a large scale sequencing project, for example a whole 
bacterial genome or a full YAC clone, the DNA must first be 
subcloned into a library. The process of producing a library of 
this sort can be done in-house or by commercially available 
services , such as that provided by Cl onetech. The D^TA is 
fragmented (e.g. by restriction enzyme digestion or sonif ication) 
to sizes in range of a few hundred bases and then subcloned into 
a cloning vector of choice. Because each fragment in the library 
is flanked by the same vector sequence a standard set of flanking 
PGR primers can be used to PCR amplify each each fragment. Using 
the same PCR primers for each fragment also helps normalise the 
efficiency of each PCR reaction as primer sequence is one of the 
most important factors affecting amplification efficiency, (see 
Figure 1) 

The library is then transfected into an appropriate bacterial 
strain and the bacteria plated out onto selective agar plates. 
Individual colonies (each containing an unique fragment contained 
within the cloning vector) are then picked by a colony picking 
robot (which are commercially available) . Each picked colony is 
then spiked into a unique PCR reaction, set up on a microtitre 
plate for example, and each fragment is PCR amplified using the 
standard primer set which flank the insert. One of the primers 
used in this reaction must be biotinylated which will allow the 
subsequent capture of the amplified fragment, (see Figure 2) 
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Following the PCR amplification, a known amount of each of the 
amplified fragments is then captured on a streptavidin coated 
surface by its biotinylated primer. By controlling the amount of 
available streptavidin a specific amount of PCR product can be 
captured. (This does, however, rely on all the primers being 
incorporated into the amplification products. This should only 
require a simple primer titration optimisation experiment as PCR 
reactions using clones are usually highly efficient.) 

Different protocols can be used for this purpose, for example 
streptavidin coated magnetic beads or streptavidin coated wells 
of a microtitre plate. When using beads, which will bind 1 pmol 
of biotin per ul of beads, adding 5ul of beads and the 
appropriate buffer to the PCR reaction will capture 5 pmol of the 
amplified fragment. The use of beads also allows the capture of 
different quantities of individual amplified fragments* By adding 
differing amounts of beads to separate amplification reactions 
prior to pooling them, one can, for example, create a 
heterogenous population with Ipmol of fragment 1, 4pmol of 
fragment 2, 10 pmol of fragment 3 and so on. Alternatively 
streptavidin coated wells of a microtitre plate could also be 
used by transferring each amplification reaction to a unique well 
of the microtitre plate. Commercially available streptavidin 
coated plates usually have a maximum binding capacity of between 
5 to 2 0pmol of biotin. Therefore, of the amount of amplified 
fragment captured to each well is determined by the binding 
capacity of that plate. 

Following capture, excess amplified fragments are then washed 
away, the double stranded PCR product is denatured with either 
alkali or heat (or both) (to free the non-biotinylated strand) . 
The non-biotinylated strand is then washed away and this leaves 
a single stranded template immobilised in the well or tube ready 
to be used in a sequencing reaction, (see Figure 3) 

DNA Sequencing Using Mass Labelled dLNTP's 

Figures 4a and 4b illustrate diagrammaticaly the method of DNA 
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sequencing using mass labelled dNTP's described below* 



An immobilised single stranded DNA template is prepared from a 
PCR product (see above) or by any other appropriate means. 
Immobilisation can be achieved by biotinylating one of the 
primers used in the PCR reaction which then allows the subsequent 
capture of the PCR product to either streptavidin coated beads 
or wells of a microtitre plate. Following capture, the non- 
immobilised strand is denatured by alkali or heat (or both) and 
washed away. The other (non-biotinylated) PCR primer is then 
annealed to its complementary site on the single stranded PCR 
product to act as a sequencing primer for a DNA polymerase* 

Extension of the sequencing primer is achieved by incubating the 
template with nucleotide triphosphates, mass labelled 
deoxynucleotides present at a low concentration and a DNA 
polymerase (e.g. taq, E. Coll , T7 DNA polymerases or their 
derivatives - acting in a template specific manner) , with 
appropriate conditions and time. Extension of the primer is 
blocked in individual templates with a known probability due to 
the presence of blocking groups on the mass labelled 
deoxynucleotides (this can be the unique 'mass label' itself) 
which prevents the 3 ' OH of the deoxynucleotides reacting with the 
5' triphosphates of further nucleotide, thus preventing the 
addition of any more bases by the polymerase. This is shown in 
step 1 of Figure 4a where the Sanger reaction is performed on 
each template separately using terminating nucleotides that carry 
mass labels . The unique mass labels are attached to the 
deoxynucleotides by a cleavable linker group. Cleavage is carried 
out here by laser light (or possibly by chemical or other means) . 
On cleavage, the mass label is released into solution for 
analysis . 

This embodiment is akin to traditional Sanger sequencing where 
the blocked nucleotides are present at a low concentration in the 
presence of ordinary triphosphates ♦ This allows one to generate 
a Sanger ladder of fragments. In this embodiment the efficiency 
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of photocleavage of the mass labels is not critical to the 
readlength achievable. One requires just a single sequencing 
reaction, which means that template is not left standing for long 
periods of time. Similarly, the potential for photodamage due to 
repeated photolytic reactions required in the iterative approach 
of patents GB 9620769.1 and GB9700760.1, is avoided. One can 
analyse the resultant sequence ladder by capillary 
electrophoresis followed by direct analysis of mass labels by 
electrospray mass spectrometry (ESMS) . 

In step 2 of Figure 4a, multiple templates are fed into the same 
capillary electrophoresis electrospray mass spectrometry system. 
As each band passes through, the mass labels are cleaved by 
photolysis and injected into an electrospray mass spectrometer 
for analysis. In a preferred arrangement, no photocleavage 
apparatus is present and, instead, the mass labels are designed 
so that, upon entry into the mass spectrometer, the conditions 
of ionisation are such that the mass labelled fragments cleave 
and the mass labels are then analysed. As shown in Figure 4b, 
the identity of mass labels determines the identity of the 
nucleotide and the source template from which the sequence is 
derived . 

The use of labelled nucleotides is the preferred format as this 
avoids certain problems associated with primer labelled 
sequencing, which is also possible with mass labelling. 
Polymerase reactions do often terminate prematurely, without the 
intervention of blocked nucleotides. This is a problem with 
primer labelled sequencing because the premature termination 
generates a background of labelled fragments that are terminated 
incorrectly. Labelling the blocking nucleotides ensures only 
correctly terminated fragments are labelled so only these are 
detected by the mass spectrometer. This then permits cycle 
sequencing where multiple rounds of primer are added to the 
template. The sequencing reaction is performed using a 
thermostable polymerase. After each reaction the mixture is heat 
denatured and more primer is allowed to anneal with the template. 
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The polymerase reaction is repeated when primer template 
complexes reform. Multiple repetition of this process gives a 
linear amplification of the signal, enhancing the reliability and 
quality of the sequence generated. 

This invention further avoids problems associated with 
fluorescence based methods. The large differences in sizes of 
the commercially available fluorescent labels causes differences 
in migration of templates of similar length. Since any set of 
4 labels used to identify a sequencing reaction can be chosen to 
be very close in size since a mass spectrometer will comfortably 
detect differences in mass of one dalton, which should have 
minimal effect on the relative migration of any given template. 

Multiplexing Sanger Ladders: 

Given a large number of mass labels one can multiplex a series 
of Sanger sequencing reactions by labelling the 4 blocking 
nucleotides with a different set of 4 mass labels in each 
reaction. Each sequencing reaction would be performed separately 
and then all the templates would be recombined at the end of the 
sequencing reactions. The Sanger ladders generated are then all 
separated together by a single capillary electrophoresis step 
feeding fragment bands directly into a mass spectrometer for 
analysis of the labels. Each set of 4 mass labels then correlates 
to a single source template. 

Simultaneous sequencing : 

This invention avoids the need for stringent quantitation. One 
can however exploit limited quantitation with this invention to 
compress the number of mass labels needed to sequence a number 
of templates in parallel. Consider the case where one has 16 
labels evenly spread across a clearly distinguishable mass range. 
One could assign the first four labels to the bases A, C, G, T 
in that order and use deoxynucleotides labelled in this way to 
sequence the first template. In a similar manner three other 
templates can be sequenced. Given that there is a roughly 1 in 
16 chance that the same base will appear at the same point in two 
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reactions one can use the same label in two separate reactions, 
labelling different bases if desired* If two reactions share all 
4 labels one can double the number of parallel reactions that can 
be analysed. If by chance two templates have the same base at a 
point, clearly only one label will appear for two templates so 
they clearly share a base. Limited quantitation would be needed 
to reassign the base calls for each band leaving the capillary 
electrophoresis system to its source template, but as long as the 
quantities of templates sharing a set of labels are distinct, 
this should be relatively simple to do. Furthermore the 

templates could be sequenced in a single reaction. More templates 
could be sequenced simultaneously with more stringent 
quantification of each template* One would need to know in 
advance how many templates one was sequencing in each reaction 
and how much of each was present at the start . This may preclude 
one from using cycle sequencing in which there is a linear 
amplification of template, if distortion of quantitation cannot 
be calibrated for. 

Separation Techniques : 

The separation of a Sanger Ladder by gel electrophoresis imposes 
limitations on the throughput and accuracy achievable for DNA 
sequencing. The polymerase reaction used to generate a Sanger 
ladder is simple and relatively fast and can readily be performed 
in parallel or even multiplexed in the same reaction. Various 
novel sequencing methods have been developed that are compatible 
with PCR and hence exploit automation using 96 well plate 
robotics and thermocyclers . 

Gel electrophoresis works on the simple principle that a charged 
molecule placed between two electrodes will migrate towards the 
electrode with the opposite charge to its own. The larger the 
molecule is for a given charge the more slowly it will migrate 
towards the relevant electrode. Nucleic acids are poly-ions, 
carrying approximately one charge per nucleotide in the molecule. 
This means that nucleic acids of any size migrate at 
approximately the same rate ignoring frictional forces from the 
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separation medium. The effect of frictional forces is related to 
the size of the molecule or in the case of nucleic acids, the 
length of the molecule. This means that nucleic acids are 
effectively separated by length. The role of the gel matrix is 
to provide frictional force to impede migration. The speed of 
separation is proportional to the size of the electric field 
between the two electrodes. This means that increasing the size 
of the electric field will reduce separation times, however the 
electrical resistance of the separation medium means that heat 
is generated as a result of the electric field and the heat 
increases with the electric field. The higher temperatures 
increase the kinetic energy imparted to the analyte leading to 
greater diffusion and band broadening. This reduces the 
resolution of the separation. Gels can be cooled but heat 
dissipation from a slab gel is limited by its surf ace /volume 
ratio which is essentially a function of the thickness of the 
gel. Thinner gels dissipate heat better but there is an 
additional effect of increased resistance. This .means that in 
slab gel techniques using gels of 200 to 400 (xm thickness heating 
becomes severe if the electric field strength is greater than 50 
V/cm. Replacement of the slab gel electrophoretic steps is the 
most attractive target in view to increasing the overall speed 
of DNA sequencing. 

Capillary electrophoresis offers significant advantages over gel 
electrophoresis as a separation technology. Various approaches 
to capillary electrophoresis exist but for nucleic acid 
separations capillary gel electrophoresis is often used. This 
technique is essentially gel electrophoresis in a narrow tube. 
The use of a capillary gives an improved surface/volume ratio 
which results in much better thermal dissipation properties. This 
allows much higher electric fields to be used to separate nucleic 
acids greatly increasing the speed of separations. Typically 
capillaries are 50 to 75 jim wide, 24 to 100 cm long and electric 
fields up to 400 v/cm can be used although lower fields are used 
routinely. Increased separation speeds also improve the 
resolution of the separation as there is less time for diffusion 
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effects to take place and so there is less band broadening. 
Improved resolution permits greater read lengths, increasing 
throughput further. The introduction of flowable polymers has 
meant that time consuming and technically demanding steps of gel 
preparation associated with slab gel electrophoresis can be 
avoided and capillaries can be prepared by injection of the 
sieving matrix. This improves the reproducibility of separations 
and the injection of polymers is a process which is readily 
automated. 

Multiplexing Sequencing Reactions: 

Primer labelled sequencing is also possible with mass labelling 
which has certain advantages over nucleotide labelled sequencing. 
Consider the situation where one has a number of templates each 
with distinct primer sequences* One can label each unique primer 
with a unique mass label . The template mixture can be divided 
into four reactions in which only one of each of the four 
terminating dideoxynucleotides is present. Each template is 
primed with its uniquely labelled primer. After performing each 
of the four Sanger reactions, one can resolve each ladder by 
capillary electrophoresis mass spectrometry. Each band that 
elutes from the capillary electrophoresis column that contains 
a terminated fragment can be related back to its source template 
by the label linked to its primer. In this way a large number of 
templates can be sequenced simultaneously in the same reaction. 

For each template with a unique primer sequence, one could choose 
to label the unique primer with a different label in each of the 
four reactions to identify which terminating nucleotide is 
present. This would allow one to pool the four individual base 
sequencing reactions and analyse them simultaneously. This has 
the advantage that all four reactions are analysed under 
identical conditions which should avoid ambiguities that might 
arise when analysing the four reactions separately due to 
variations in conditions in each analysis. 

The advantage of labelling nucleotides in order to detect only 
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correctly terminated nucleotides is lost by labelling the primer. 
This advantage could be conferred to primer labelling by 
modifying the terminating nucleotides to carry a marker to allow 
correctly terminated fragments to be retained for example by 
affinity to a column. A plausible modification would be addition 
of (biotimamido) pentylamine to the terminating nucleotides which 
would allow reversible binding to avidin. For normal sanger 
sequencing where only one template is analysed at a time this is 
probably not worthwhile whereas for reactions where many 
templates are sequenced simultaneously the additional cost of the 
separation would be tolerable for the improved quality of the 
sequence data that would be generated. 

Preparation of templates with unique primers : 

In order to permit simultaneous sequencing reactions with mass 
labels one requires that each template be identifiable with a 
uniquely labelled sequencing primer. One could conceivably 
engineer a family of cloning vectors that bear different primer 
sequences flanking the integration site for the exogenous DNA to 
be sequenced. Each sequencing reaction would be performed on a 
group of templates where only one template derived from each 
vector type is present so that all the templates in a reaction 
bear unique primers. Further details of these methods are 
described in copending International Patent Application filed on 
13th July 1998 by the present applicants [PWF Ref : 87847] 

Adaptors to introduce vrimers to restriction fragments: 
One can, however, exploit the ability to sequence numerous 
templates simultaneously to cut out sub-cloning steps in a 
sequencing project. Consider a large DNA fragment such as a 
mitochondrial genome or a cosmid. One can cleave such a large 
molecule with a frequently cutting restriction enzyme to generate 
fragments of the order of a few hundred bases in length. If one 
uses a restriction endonuclease like Sau3Al one is left with 
fragments with a known sticky end to which one can ligate 
adaptors bearing a known primer sequence . 
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This approach is shown diagrammatically in Figure 5. In step 1, 
a genomic DNA clone is treated with a frequent cutting 
restriction endonuclease such as Sau3Al . In step 2, adapters are 
ligated to the restriction fragments bearing specific primer 
sequences. All fragments are dealt with simultaneously although, 
for simplicity, only one is shown in the Figure. In step 3, the 
DNA is denatured after an optional amplification step and 
optional cleaning up steps to remove unligated adapters and 
restriction enzymes. Figure 6 shows a continuation of this 
process in which the single-stranded DNA is captured using an 
immobilised primer complementary to the adapter primer sequence. 
In step 5, the immobilised primer and appropriate polymerase is 
used to generate a complementary strand. In step 6, the free 
strand is melted off and washed away or recaptured onto 
immobilised primer to generate further copies of template, if 
desired. In Figure 7, mass labelled primers are added at step 
7. The mass label identifies bases overlapping into unknown 
sequence. By step 8, the majority of DNA molecules in a small 
population should be uniquely primed in this way and a primed 
population can thus be used for extension in a Sanger sequencing 
or cycle sequencing reaction. 

Similar steps are shown in Figures 8 to 10 . 

The majority of properly restricted fragments should as a result 
bear an adaptor at each of their termini. This permits 
amplification of the adaptored restriction fragments at this 
stage if that is desired. After adaptoring and any amplification, 
one denatures the adaptored fragments and hybridises these 
fragments to a 'capture' primer. The capture primer could be 
biotinylated and presented to the adaptored fragments free in 
solution, after which captured fragments can be immobilised onto 
a solid phase support derivitised with avidin. Alternatively the 
primer could be immobilised onto a solid phase support prior to 
exposure to the adaptored restriction fragments. At this stage 
one would divide one's template into four separate pools in order 
to. sequence each pool with a different terminating nucleotide. 
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The captured fragments are made double stranded at this stage by- 
reaction with a polymerase . This means that immobilised copies 
of all sequences should be present . The hybridised strand can be 
melted off at this stage and be disposed of if that is desired. 
An immobilised complementary strand is retained. One can also 
amplify the sequence present at this stage by further 
hybridisation with capture primer. 

After denaturing free DNA from the immobilised copies of the 
template and disposing of free DNA, one can add a series of 
'sequencing' primers to the reaction. These primers bear the 
primer sequence in the adaptor and the restriction site by which 
the adaptors were originally ligated to the DNA and an additional 
overlap of a predetermined number of bases. If one has 64 labels 
available the overlap can be 3 bases. Each of the possible 3 base 
overlaps can be identified by a unique mass label. Given a 
population of the order of 5 0 to 6 0 templates one would expect 
the majority to have a different 3-mer adjacent to the ligated 
primer. Thus the majority of templates will be expected to 
hybridise to a distinct primer. Any template that bears a 3-mer 
immediately adjacent to the adaptor that is the same as that on 
another template would only be resolvable if one is able to 
determine by the quantity of each template which template to 
assign a base call to. 

With the majority of templates primed with a unique primer one 
can add polymerase, nucleotide triphosphates and one of the four 
blocking nucleotides to each reaction and can generate Sanger 
ladders (Figure 10, step 8) * If a thermostable polymerase is 
used, then the ladders can be denatured at the end of each cycle 
and fresh primers can be added. If cycle sequencing is used then 
one would almost certainly want some means to select for properly 
terminated fragments since cycle sequencing not only amplifies 
the number of properly terminated fragments but also the number 
of improperly terminated fragments. 



The sanger ladders from each of the four sequencing reactions are 
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then preferrably pooled (Figure 10, step 9) and analysed together 
by capillary electrophoresis mass spectrometry so as to avoid any 
ambiguities in assigning bases due to experimental differences. 
Each pool of templates would thus have to have its primers 
labelled with a unique set of mass labels. Thus a total of 256 
mass labels would be required. Each primer thus has four labels, 
one for each terminator reaction. The labels assigned to each 
primer should be close in mass and size to minimise differences 
in migration between each termination reaction. 

Multiplexing with nucleotide labelled reactions: 

A further embodiment of this invention is multiplexing multiple 
templates in reactions with labelled nucleotides. 

Consider a reaction in which unmodified ATP, CTP, GTP and TTP are 
present with the four corresponding uniquely mass labelled 
terminating nucleotides. One can generate Sanger ladders for a 
number of templates simultaneously in the same reaction vessel. 
If these different templates share a sequencing primer, they can 
be subsequently sorted into separate groups prior to separation 
on the basis of the sequence immediately adjacent to the primer. 
One could separate the fragments onto a hybridisation array where 
the array bears a sequence complementary to the sequencing primer 
at all points and an additional predetermined number of bases, 
N, such that each location on the array bears just one of the 
possible N base sequences. This means if N is 4 there would .be 
25 6 discrete locations on the array. It is expected that a group 
of templates would in most cases have distinct sequences 
immediately adjacent to the primer. 

This would be an expensive exercise for sorting templates from 
just one reaction vessel. With a large number of mass labels, 
however, one can have distinct sets of 4 mass labels identifying 
blocking nucleotides in a large number of reactions. Thus 
multiple templates can be added to different reaction vessels, 
preferably different templates to each reaction vessel . After 
generating Sanger ladders in each vessel, the reactions can be 
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pooled and the templates from each reaction can be sorted 
simultaneously. One would expect the majority of ladders of each 
template from each reaction to segregate to discrete locations 
on an array and that each location on the array would receive 
template ladders from a number of distinct reactions. 

Having sorted ladders to discrete locations on an array one needs 
to separate the ladders from each location and identify the mass 
labels that terminate each set of fragments of each length. How 
one does this would depend on the array used. 

Practically speaking, a hybridisation array could comprise an 
array of wells on microtitre plates, for example, such that each 
well contains a single immobilised oligonucleotide that is a 
member of the array. In this situation a sample of the pooled 
reactions is added to each well and allowed to hybridise to the 
immobilised oligonucleotide present in the well. After a 
predetermined time the unhybridised DNA is washed away. The 
hybridised DNA can then be melted off the capture oligonucleotide 
and loaded into a capillary electrophoresis mass spectrometer. 

Equally the array could be synthesised combinatorially on a glass 
"chip" according to the methodology of Southern or that of 
Affymetrix. One could hybridise the pooled Sanger ladders to the 
chip and wash away unhybridised material. If the probes of the 
array are immobilised with a linker containing a photocleavable 
group, ladders from discrete locations on the array could be 
released into solution by application of laser light to the 
desired location on the array. The solution phase ladders can 
again be simply loaded into a capillary electrophoresis mass 
spectrometer . 

Again, the advantage of multiplexing and sorting templates is the 
ability to avoid a number of subcloning steps in a large scale 
sequencing project. One would prepare template as described 
above for primer labelled multiplexing but at the stage when 
sequencing primer is added, the primers used would not be mass 
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labelled. The blocking nucleotides in each reaction would be 
labelled instead. 

Features of Mass labels: 

To acheive the required behaviour from a mass label, certain 
chemical properties are desirable- These are represented in 
particular molecular groups or moieties that can be incorporated 
into mass labels in a number of ways. 

For the purposes of generating mass labels, favoured labels 
require a cleavable bond in the linker and fragmentation 
resistant bonds in the mass label. 



For optimal performance using present techniques a mass/charge 
ratio of up to 2000 to 3000 units is the optimal range for such 
labels as this corresponds to the range over which singly charged 
entities can be reliably detected with greatest sensitivity, 
however labels of mass less than 100 to 2 00 daltons are not ideal 
as the low mass end of the spectrum tends to be populated by 
solvent molecules, small molecule impurities, multiple ionisation 
peaks and fragmentation peaks. 

To permit detection one requires labels that have a net charge, 
but are preferrably not multiply ionisable, i.e. they have a 
fixed single charge. Furthermore they should should be resistant 
to fragmentation. This ensures that each peak in the mass/charge 
spectrum corresponds to a single label and simplifies the 
analysis of the data. Furthermore this reduces any ambiguity in 
the determination of the quantity of the label, which is very 
important for some of the applications for which this invention 
has been developed. 




Label 
A 



Easily Cleaved 



Resistant to 
fragmentation 
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Various functionalities exist which carry or can carry positive 
charges for positive ion mass spectrometry. These include but are 
not limited to amines particularly tertiary amines and quaternary 
amines. Quaternary ammonium groups carry a single positive charge 
and do not require ionisation. For positive ion spectrometry 
these allow great sensitivity. Hence preferred positive ion mass 
labels should carry at least one such group. 

Various functionalities are available to carry a negative charge 
for negative ion mass spectrometry which include but are not 
limited to carboxylic acids, phosphonates , phosphates , phenolic 
hydroxyls, sulphonic acids, sulphonilamides , sulphonyl urea, 
tetrazole and perfluoro alcohol. 

Ionisation techniques : 

For many biological mass spectrometry applications so called 
'soft' ionisation techniques are used* These allow large 
molecules such as proteins and nucleic acids to be put into the 
mass spectrometer in solutions with mild pH and at low 
concentrations. Two such techniques are ideal for use with this 
invention; electrospray ionisation and Matrix Assisted Laser 
Desorption Ionisation (MALDI) . 

Electrospray Ionisation: 

Electrospray ionisation requires that the dilute solution of 
biomolecule be 'atomised' into the spectrometer, i.e. in a fine 
spray. The solution is, for example, sprayed from the tip of a 
needle across an electrostatic field gradient or into a stream 
of dry nitrogen in an electrostatic field. The mechanism of 
ionisation is not fully understood but is thought to work broadly 
as follows. In a stream of nitrogen the solvent is evaporated. 
With a small droplet, this results in concentration of the 
biomolecule. Given that most biomolecules have a net charge this 
increases the electrostatic repulsion of the dissolved protein. 
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As evaporation continues this repulsion ultimately becomes 
greater than the surface tension of the droplet and the droplet 
'explodes' into smaller droplets. The electrostatic field helps 
to further overcome the surface tension of the droplets. The 
evaporation continues from the smaller droplets which, in turn, 
explode iteratively until essentially the biomolecules are in the 
vapour phase, as is all the solvent . This technique is of 
particular importance in the use of mass labels in that the 
technique imparts a relatively small amount of energy to ions in 
the ionisation process and the energy distribution within a 
population tends to fall in a narrower range when compared with 
other techniques . The ions are accelerated out of the ionisation 
chamber through a pair of electrodes. The potential difference 
across these electrodes determines whether positive or negative 
ions pass into the mass analyser and also the energy with which 
these ions enter the mass spectrometer. This is of significance 
when considering fragmentation of ions in the mass spectrometer. 
The more energy imparted to a population of ions the more likely 
it is that fragmentation will occur. By adjusting the 
accelerating voltage used to accelerate ions from the ionisation 
chamber one can control the fragmentation of ions. 

Matrix Assisted Laser Desorption Ionisation (MALDI) : 

MALDI requires that the biomolecule solution be embedded in a 
large molar excess of an photo-excitable 'matrix'. The 
application of laser light of the appropriate frequency ( 2 66 nm 
beam for nicotinic acid ) results in the excitation of the matrix 
which in turn leads to excitation and ionisation of the embedded 
biomolecule. This technique imparts a significant quantity of 
translational energy to ions, but tends not to induce excessive 
fragmentation despite this. Accelerating voltages can again be 
used to control fragmentation with this technique though. 

MALDI techniques can be supported in two ways. One can embed mass 
labelled DNA in a MALDI matrix, where the labels themselves are 
not specifically excitable by laser or one can construct labels 
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that contain the necessary groups to allow laser energisation. 
The latter approach means the labels do not need to be embedded 
in a matrix before performing mass spectrometry. Such groups 
include nicotinic, sinapinic or cinnamic acid moieties. MALtDI 
based cleavage of labels would probably be most effective with 
a photocleavable linker as this would avoid a cleavage step prior 
to performing MALDI mass spectrometry. The various excitable 
ionisation agents have different excitation frequencies so that 
a different frequency can be chosen to trigger ionisation from 
that used to cleave the photolysable linker. These excitable 
moieties are easily derivitised using standard synthetic 
techniques in organic chemistry so labels with multiple masses 
can be constructed in a combinatorial manner. 



Fragmentation within the Mass Spectrometer; 

Fragmentation is a highly significant feature of mass 
spectrometry. With respect to this invention it is important to 
consider how one intends to identify a mass label. At the two 
extremes one can either design molecules that are highly 
resistant to fragmentation and identify a label by the appearance 
of the label's molecular ion in the mass spectrum. One would thus 
design families of labels to have unique molecular ions. At the 
other extreme one can design a molecule with a highly 
characteristic fragmentation pattern that would identify it* In 
this case one must design families of labels with non-overlapping 
patterns or with at least one unique fragmentation species for 
each label by which to identify each label unambiguously. 
Fragmentation is, however, a property of the molecule and of the 
ionisation technique used to generate the ion. Different 
techniques impart differing amounts of energy to the ion and the 
chemical environment of the ions will vary considerably, thus 
labels that are appropriate for one mass spectrometry technique 
may be inappropriate in others. The preferred approach is to 
design fragmentation resistant molecules, although some 
fragmentation is inevitable. This means one aims to identify 
molecules with a single major species, either the molecular ion 
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Determining bond stability in the mass spectrometer: 

In neutral molecules it is reasonably simple to determine whether 
a molecule is resistant to fragmentation, by consideration of 
bond strengths. However, when the molecule is ionised, the bond 
strength may increase or decrease in ways that are difficult to 
determine a priori. For example, given a bond, X-Y, we can write: 

In the equations above, D(A-B) refers to bond energy of the 
species in parentheses, I (N) refers to the ionisation energy of 
the species in parentheses and delta-H is the free energy of 
formation of the species in parentheses. The upshot of the 
equations above is that in order to predict whether a bond is 
likely to be stable under a given set of ionisation conditions 
we need to know the energy of ionisation of the molecule and the 
energy of ionisation of the neutral fragment that results from 
fragmentation at the bond in question. 

For example, consider the ON bond in aniline: 

I(NH 2 ) =11.14 electronvolts (ev)md I (C^NH,) =Uev 
/, I(C 6 H 5 NH 2 ) <I(NH 2 *) by 3.44ev 

The alternative cleavage at this bond is: 

J(C 6 H/) ==9 - 35 electronvolts (ev) and /(Q/^M^) ^7. lev 
,', I(C 6 H 5 NH 2 )<I(C 6 H 5 -) 

This bond is thus not easily broken in the ion. Aniline, if it 
has sufficient initial energy to fragment, is generally observed 
to cleave releasing HCN, rather than by cleavage of a C-N bond. 
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I(OH') =\3ev and I(C 6 H,OH) =8.47ev 
I(C 6 H 5 OH) <I(OH') 

Similarly considerations apply to phenol : 

The alternative cleavage at this bond is 

J(C 6 H 5 *) =9. 35ev and I{C 6 H 5 OH) =8.47ev 
/. I(C 6 H s OH) <I(C 6 H 5 ') 

Thus C-0 cleavage is not observed. 

Determining the differences in ionisation energies of molecule 
and neutral fragments is a general working principle which can 
be used to predict likely ion bond strengths . If the energy added 
during ionisation is less than the ionic bond strength then 
ionisation will not be observed. Typical ionic bonds that have 
good strength include, aryl-O, aryl-N, aryl-S bonds. Generally, 
aliphatic type bonds become less stable in ionic form. Thus 
single C-C bonds are weak in the ion but C=C is still strong* 
Aryl~C=C tends to be strong too for the same reasons as aryl-O, 
etc. Aryl or Aryl-F bonds are also strong in ionic form which is 
appealing as f luorocarbons are cheap to manufacture and are 
chemically inert . 

Similar considerations apply to negative ions, except one must 
use electron affinities in the equations above rather than 
ionisation energies . 

Linkers to allow controlled release of mass labels: 

Controllable release of mass -labels from their relevant molecule 
can be be achieved in response to light or chemical triggers or 
can be achieved within the mass spectrometer through control of 
fragmentation. Photo- cleavable and chemically cleavable linkers 
can be easily developed for the applications described * Many are 
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commercially available. 

Cleavage of mass labels within the mass spectrometer: 

An alternative to chemical or photolytic cleavage is the use of 
the ionisation process to induce fragmentation of labels. One can 
design a linker that is highly labile in the ionisation process 
such that it will cleave when the molecule to which it is 
attached is ionised in a mass spectrometer. There are two factors 
to consider in controlling cleavage: 1) how much excess of energy 
is deposited on the ion during the ionisation process and 2) 
whether this excess is sufficient to overcome bond energy in the 
ion. The excess of energy deposited is strongly determined by the 
ionisation technique used. The bond energy is obviously 
determined by the chemical structure of the molecule being 
analysed. 

Fragmentary linkers: 

As discussed above certain groups are particularly resistant to 
fragmentation, while others such as aliphatic type bonds are 
reasonably susceptible to cleavage. In order to design a linker 
that cleaves in a specified location, one might design a molecule 
that is broadly resistant to fragmentation but that contains a 
'weak link', whose fragment ion is stabilised by the surrounding 
molecule. Certain structural features are observed to stabilise 
fragments ions when cleavage occurs at certain bonds in a 
molecule. Linear alkanes are seen to fragment relatively randomly 
while molecules containing secondary and tertiary alkyl groups 
are seen to fragment most commonly at the branch points of the 
molecule due to the increased stability of the secondary and 
tertiary carbocations. Similarly double bonds stabilise adjacent 
carbocations through resonance or delocalisation effects. Similar 
effects are noted in bonds adjacent to aryl-C- groups. 

For the purposes of generating a linker for mass labels, one 
requires a single mass spectrometrically weak bond in the linker 
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and strong ones in the mass label of the sorts described above 
A typical weak bond would be: 

o 

II 
c 



Meta.sta.ble ions and autoclea.va.cre of mass labels: 

Stability is in mass spectrometry is dependant on the ionisation 
technique used. Broadly speaking an unstable ion is a species 
that will fragment in the ionisation chamber. Similarly a stable 
ion will not fragment before reaching the detector. Metastable 
ions are thus ions that fragment somewhere between the ion source 
and detector. The temporal distinctions between the classes is 
somewhat dependant on the configuration of the mass spectrometer 
used. As mentioned before ionisation techniques have a 
considerable influence on the degree of fragmentation induced in 
a population of molecules so what may be a stable ion in 
electrospray ionisation may be highly unstable under electron 
ionisation. Furthermore the geometry of the separation stages of 
the mass spectrometer determines how long an ion exists before 
detection - ion trap mass spectrometers can obviously store ions 
for considerable periods of time before detection so molecules 
which in other geometries reach a detector in high abundance will 
have time in a trap to fragment. 



These factors all have a bearing on the nature of molecule used 
as a mass label and as a linker. Clearly the design envelope 
available for such molecules is fairly large. 



Induced clea.va.ge of labels: 

Various analytical techniques have been developed over the years 
to promote fragmentation of ions for use in structural studies 
and for unambiguous identification molecules on the basis of 
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fragmentation fingerprints. Most of the ionisation techniques 
will cause some degree of fragmentation but variations on 
chemical ionisation techniques can most simply be used to aid 
fragmentation. Electrospray ionisation can be modified slightly 
to promote fragmentation. The ionisation chamber can be modified 
to include a discharge electrode which can be used to ionise the 
bath gas which in turn will collide with the vaporised sample 
molecules increasing ionisation and fragmentation of the sample. 
This technique is termed Atmospheric Pressure Chemical Ionisation 
(APCI) . 

A more active approach to fragmentation entails inducing 
decomposition of molecules such as collision induced 
decomposition (CID). CID uses tandem mass analyser/spectrometer 
constructions to separate a stream of ions, then induce collision 
of the separated ion stream to promote fragmentation followed by 
analysis of the resultant ions by a second mass spectrometer. A 
typical tandem mass geometry comprises two quadrupole mass 
analysers separated by a collision chamber. This is just a 
chamber between the two quadrupoles into which a gas can be 
introduced to allow collision with the ion stream from the first 
mass analyser. The gas density in the collision chamber must not 
be too high to permit the collision fragments to pass through, 
for subsequent separation and analysis by the second quadrupole* 
Mass labelled molecules could be separated in tandem mass 
spectrometer so that the first quadrupole separates molecules 
into streams of a given mass/charge ratio followed by collision 
which would favour cleavage of labels which can then be analysed 
in a second mass analyser. 

Other techniques are compatible with mass label technologies. A 
preferred method as discussed earlier is photon induced 
decomposition. Photon induced decomposition would involve the use 
of photocleavable mass labels. A typical geometry uses a tandem 
mass analyser configuration similar to those used in CID, but the 
collision cell is replaced with a photo-excitation chamber in 
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which the ion stream leaving the first quandrupole is subjected 
to laser light. High intensity lasers are required to ensure that 
a significant proportion of a fast moving ion stream interacts 
with a photon appropriately to induce cleavage. The positioning 
of the laser is extremely important to ensure exposure of the 
stream for a significant period of time. Tuning the laser to a 
specific frequency allows for precise control over the bonds that 
are induced to cleave. Thus mass labels linked with an 
appropriate photocleavable linker to their probes can be readily 
cleaved within the mass spectrometer. The photocleavage stage 
does not require a tandem geometry, the photocleavage chamber 
could be within or immediately following the ion source. 

A further technique is surface induced decomposition . Surf ace 
induced decomposition is another tandem analyser technique that 
involves generating an ion stream which is passed through the 
first analyser. This stream is then collided with a solid surface 
at a glancing angle. The collision fragments can then be analysed 
by a second separator and detector configuration. 

Induced cleavage can be performed in geometries other than tandem 
analysers. Ion traps mass spectrometers can promote fragmentation 
through introduction of a gas into the trap itself with which 
trapped ions will collide. Ion traps generally contain a bath 
gas, such as helium but addition of neon for example, promotes 
fragmentation. Similarly photon induced fragmentation could be 
applied to trapped ions. 



Mass Label Chemistries: 

For any practically or commercially useful system it is important 
that construction of labels be as simple as possible using as few 
reagents and processing steps as possible. A combinatorial 
approach in a which a series of monomeric molecular units are 
available to be used in multiple cominations with each other 



WO 99/02728 

would be ideal • 



- 37 - 



PCT/GB98/02048 



One can synthesise mass labels using standard organic chemistry 
techniques. Such labels ought to carry a single charge bearing 
group and should be resistant to fragmentation in the mass 
spectrometry technique used. Amine derivatives, quaternary 
ammonium ions or positive sulphur centres are good charge 
carriers if positive ions mass spectrometry is used. These have 
extremely good detection properties that generate clean sharp 
signals. Similarly, negatively charged ions can be used, so 
molecules with carboxylic acid, sulphonic acid and other moieties 
are appropriate for negative ion spectrometry. Labels for MALDI 
mass spectrometry can be generated by derivitising known 
molecules that are excitable by UV laser light, such as 
sinapinnic acid or cinnamic acid, of which a number of 
derivatives are already commercially available* Fragmentation 
resistant groups are discussed above* For a text on organic 
chemistry see: 

° Vogel's "Textbook of Organic Chemistry' 7 4th Edition, Revised 
by B.S. Furniss, A.J. Hannaford, V. Rogers, P.W.G. Smith & A,R, 
Tatchell, Longman, 1978. 

Amino acids : 

With a small number of amino acids such as glycine, alanine and 
leucine, a large number of small peptides with different masses 
can be generated using standard peptide synthesis techniques well 
known in the art. With more amino acids many more labels can be- 
synthesised. 

o E. Atherton and R.C. Sheppard, editors, 'Solid Phase Peptide 
Synthesis: A Practical Approach', IRL Press, Oxford, 



Carbohydra tes 
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Similarly carbohydrate molecules are useful monomeric units that 
can be synthesised into heteropolymers of differing masses but 
these are not especially amenable to ESMS. 

o Gait, M.J. editor, * Oligonucleotide Synthesis: A Practical 
Approach' , IRL Press, Oxford, 1990 

o Eckstein, editor, 'Oligonucleotides and Analogues: A Practical 
Approach', IRL Press, Oxford, 1991 

Other labelling chemistries : 

Clearly almost any molecule can be tacked onto another as a 
label. Obviously the properties of such labels in the mass 
spectrometer will vary. In terms of analysing biomolecules it 
will be important that the labels be inert, bear a single charge 
and be resistant to fragmentation. 

Linkers for cleavage within the mass spectrometer 

Compounds having the formula N-L-M are useful in the present 
invention where cleavage in the mass spectrometer is desired. 
N comprises one or more nucleic acid bases and would constitute 
the nucleotide or oligonucleotide probe or primer. L comprises 
a linker moiety and M comprises a mass marker optionally having 
a metal ion-binding moiety. The metal ion-binding moiety is a 
porphyrin, a crown ether, hexahistidine, or a multi -dentate 
ligand, preferably a bi-dentate ligand or EDTA. The metal ion- 
binding moiety may be bound to a monovalent, divalent or 
trivalent metal ion such as a transition metal ion or a metal ion 
of Group IA, IIA or IIIA of the periodic table. Preferably, the 
metal ion is Ni 2+ , Li + , Na\ K*, Mg 2+ , Ca 2+ , Sr 2 *, Ba 2 \ or Al 3+ . 

The mass marker may comprise a substituted or unsubstituted 
polyether which may be a substituted or unsubstituted poly{aryl 
ether) . The polyether may comprise one or more fluorine atom 
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substituents . 

Lisa group having the formula -R^Si-R^ in which R x and R 2 are 
substituents selected such that when the compound reacts with an 
electron donating moiety, either N or M cleaves from the Si atom 



in preference to R x and R 2 . R x and R 2 are selected such that each 
has a bond energy to Si greater than the bond energy of N and/or 
M to Si to ensure that when the compound reacts with an electron 
donating moiety either N or M cleaves from the Si atom in 
preference to R x and R 2/ and/or R x and R 2 are selected such that 
their steric bulk is sufficient to ensure that when the compound 
reacts with an electron donating moiety either N or M cleaves 
from the Si atom in preference to R x and R 2 . 

More preferably, R^ and R 2 are independently a hydrogen atom, a 
halogen atom, a substituted or unsubstituted alkyl group or a 
substituted or unsubstituted aryl group. More preferably, R x and 
R 2 are each independently fluorine, chlorine, bromine, iodine, 
methyl, ethyl , propyl, isopropyl, butyl, isobutyl, tert -butyl or 
phenyl groups. The electron-donating moiety may be a lewis base 
such as ammonia, a primary secondary or tertiary amine, a 
compound containing a hydroxy group, an ether or water. 

In one example, Lisa group having the formula 



R 
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in which R is an electron-withdrawing substituent such as a 
hydrogen atom, halogen atom, or a substituent comprising a 
fluorine atom, a chlorine atom, a bromine atom, an iodine atom, 
a trif luoroacetyl group, or a trif luoromethyl acetate group . L 
may be attached to N and/or M by a -CO-NH- group, an -NH-CO-NH- 
group, an -NH-CS-NH- group, a -CH 2 -NH- group, an ~S0 2 -NH- group, 
an -NH-CH 2 -CH 2 - group, or an -OP (=0) (0)O- group. 

Quantification and mass spectrometry: 

For the most part biochemical and molecular biological assays are 
quantitative. The mass spectrometer is not a simple device for 
quantification but use of appropriate instrumentation can lead 
to great sensitivity. It must always be remembered that the ion 
count is not a direct measure of the source molecule quantity, 
the relationship is a complex function of the molecule's 
ionisation behaviour. Quantitation is effected by scanning the 
mass spectrum and counting ions at each mass/charge ratio 
scanned. The count is integrated to give the total count at each 
point in the spectrum over a given time. These counts can be 
related back to the original qunatities of source molecules in 
a sample. Methods for relating the ion count or current back to 
the quantity of source molecule vary. External standards are one 
approach in which the behaviour of the sample molecules is 
determined prior to measurement of unknown sample. A calibration 
curve for each sample molecule can be determined by measuring the 
ion current for serial dilutions of a sample molecule when fed 
into the instrument configuration being used. 

Internal standards are probably the more favoured approach rather 
than external standards, since an internal standard is subjected 
to the same experimental conditions as the sample so any 
experimental vagaries will affect both internal control and 
sample molecule. To determine the quantity of a sample molecule, 
an internal standard of a known quantity is added to the sample. 
The internal standard is chosen to have a similar ionisation 
behaviour as the molecule being measured. Thus the ratio of 
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sample ion count to standard ion count can be used to determine 
the quantity of sample as the ratio of qunatities should be the 
same. Choosing appropriate standards is the main difficulty with 
this approach. One must find a molecule that is similar but not 
identical in its mass spectrum. A favourable approach is to 
synthesise the sample molecule with appropriate isotopes to give 
a slightly different mass spectrum, for a molecule with the same 
chemical behaviour. This approach might be less desirable than 
external standards for use with large numbers of mass labels due 
to the added expense of finding or synthesising appropriate 
internal standards but will give better qunatif ication than 
external standards. An alternative to isotope labelling is to 
identify a molecule that has similar but not identical chemical 
behaviour as the sample in the mass spectrometer. Finding such 
analogues is difficult and is a significant task for large 
families of mass labels. 

A compromise approach might be appropriate though, since large 
families of mass labels will ideally be synthesised 
combinatorially , and will thus be related chemically. A small 
number of internal controls might be used, where each individual 
control determines the quantities of a number of mass labels. The 
precise relationship between internal standard and each mass 
label might be determined in external calibration experiments to 
compensate for any differences between them. 

The configuration of the instrument is critical to determining 
the actual ion count itself, particularly the ionisation method 
and the separation method used. Certain separation methods act 
as mass filters like the quadrupole which only permits ions with 
a particular mass charge ratio to pass through at one time. This 
means that a considerable proportion of sample never reaches the 
detector. Furthermore most mass spectrometers only detect one 
part of the mass spectrum at a time. Given that a large 
proportion of the mass spectrum may be empty or irrelevant but 
is usually scanned anyway, this means a further large proportion 
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of the sample is wasted- These factors may be a problem in 
detecting very low abundance ions but these problems can in large 
part be overcome by correct configuration of the instrumentation. 



To ensure better quantification one could attempt to ensure all 
ions that are generated are detected. Mattauch-Herzog geometry 
sector instruments permit this but have a number of limitations. 
Sector instruments are organised into distinct regions, 
'sectors', that perform certain functions. In general the 
ionisation chamber feeds into a free sector which feeds into an 
'electric sector' . The electric sector esentially 'focusses' the 
ion beam which is divergent after leaving the ion source. The 
electronic sector also ensures the ion stream has the same 
energy. This step results in the loss of a certain amount of 
sample. This focussed ion beam then passes through a second free 
area into a magnetic sector which splits the beam on the basis 
of its mass charge ratio. The magnetic sector behaves almost like 
a prism. A photographic plate can be placed in front of the split 
beam to measure the intensities of the spectrum at all positions. 
Unfortunately there is a limit on the dynamic range of these 
sorts of detector and it is messy and cumbersome- Better dynamic 
range is achievable with electron multiplier arrays, but at a 
cost of loss in resolution which is limited by how close together 
the elements of the array can be constructed. With a family of 
well characterised mass labels one would probably monitor only 
sufficient peaks to sample all the mass labels unambiguously. 
In general array detectors would allow one to simultaneously and 
continuously monitor a number of regions of the mass spectrum 
simultaneously, which might be applicable for use with well 
characterised mass label families. The limit on the resolution 
of closely spaced regions of the spectrum might restrict the 
number of labels one might use, though, if array detectors are 
chosen. For ' selected ion monitoring' (SIM) the quadropole has 
an advantage over many configurations in that the fields that 
filter ions can be changed with extreme rapidity allowing a very 
high sampling rate over a small number of peaks of interest. 
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Orthogonal TOF mass spectrometry: 

An approach that is preferable to array geometries is the 
orthogonal time of flight mass spectrometer (see Figure 11) . This 
geometry that allows for very fast sampling of an ion stream 
followed by almost instantaneous detection of all ion species . 
The ion current leaving the source, probably an electrospray 
source for many biological applications, passes an electrode 
plate perpendicular to the current. This plate is essentially an 
electrical gate and is used to generate a repulsive potential 
which deflects the ion current 7 orthogonally' into a time of 
flight mass analyser that uses a reflectron. The reflectron is 
essentially a series of circular electrodes that generate an 
increasingly repulsive electromagnetic fieldthat normalises ion 
energies and reflects the ion stream into a detector. The 
reflectron is a simple device that greatly increases the 
resolution of TOF analysers. Ions leaving the ion source will 
have different energies, faster ions will penetrate the repulsive 
field further than ions with a lower energy and so will be 
delayed slightly with respect to the lower energy ions but since 
they will arrive slightly before the lower energy ions they will 
enter the TOF at roughly the same time so all the ions of a given 
mass charge ratio will arrive at the detector at roughly the same 
time. When the electrical gate is 'closed' to deflect ions into 
the TOF analyser, the timer is triggered. The flight time of the 
deflected ions is recorded and this is sufficient to determine 
their mass/charge ratio. The gate generally only sends a short 
pulse of ions into the TOF analyser at any one time. Since the 
arrival of all ions is recorded and since the TOF separation is 
extremely fast, the entire mass spectrum is measured effectively 
simultaneously. Furthermore, the gate electrode can sample the 
ion stream at extremely high frequencies so very little sample 
is required. For these reasons this geometry is extremely 
sensitve, to the order of a few f emtomoles . 
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Primer Extension and Parallel Sequencing of Subsets of Nucleic 
Acid Fragments: 

Sequencing a single molecule by ligation of single stranded 

oligonucleotides to a Sanger-like sequence ladder: 

Referring to Figure 12, consider a population of copies of a 
single nucleic acid, immobilised at one terminus to a fixed 
insoluble matrix. This molecule is rendered single stranded, 
except for a short stretch of double -stranded DNA at the 
immobilised terminus of the molecule to serve as a primer for the 
DNA polymerase reaction. This primer sequence could be provided 
by the adaptor used to immobilise the terminus or could be a PGR 
primer if the template is an amplified product . Thus far the 
technology is as taught by (S. Stahl, T. Hultman, A. Olsson, T. 
Moks, M. Uhlen, 1988, "Solid Phase DNA sequencing using the 
biotin-avidin system". Nucleic Acids Research. 16, 3025-3038) for 
solid phase Sanger synthesis. An important feature of this 
approach is in the blocking nucleotides, which would preferably 
not be dideoxynucleotides , but rather would be reversibly 
blocked. This means that after allowing the template to be 
polymerised in the presence of chain terminating nucleotides, one 
is left with the expected Sanger ladder of fragments. One can 
wash away the polymerisation reagents leaving the immobilised 
Sanger fragments. At this stage the blocking groups are removed 
to expose a 3' hydroxyl which is amenable to further extension. 
In the simplest case each terminating base is labelled with a 
mass label that identifies it uniquely which can be analysed by 
mass spectrometry as described above, however this does not allow 
one to really exploit the possiblity of analysing a number of 
heterogenous sequence templates simultaneously. Thus one would 
prefer to use a removable blocking group whose identity is not 
determined and which exposes the 3 ' -OH of the blocked sequence 
for further extension. 

To these exposed termini one can then ligate oligonucleotides of 
a pre -determined length (N) each bearing a photocleavable mass 
label that identifies the sequence of the oligonucleotide so that 
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the next N bases of the sequence 3' of the previously nucleotide 
are idenitif led by a complementary oligonucleotide bearing a 
label that identifies its sequence. This is shown in Figure 13. 
Suitable labelling systems for use with this invention are 
described herein, as well as in PCT/GB98/00127 of the present 
applicant. Ligation can be chemical or enzymatic. This stage of 
the sequencing process will therefore extend each fragment 
composing the chain terminated population by N bases leaving one 
with a new ladder of terminated fragments. Mass- labelled 
oligonucleotides would preferably be added in two sets of 128 
such that each member in the first set would have its complement 
in the other set. This overcomes the problem of cross- 
hybridisatin between complementary 4-mers. 

The immobilised matrix can then be washed to remove any unbound 
oligonucleotides, a water wash would probably be sufficient to 
disrupt hybridisation. To determine the sequence of the 4 base 
oligonucleotide that ligated to each Sanger fragment, one need 
only analyse the label attached to the 3 ' end of the 
oligonucleotide. The labelling system for use with this invention 
is described in PCT/GB98/00127 in which the mass of the label 
identifies its carrier. Such labels can be made photolabile or 
cleavable by a specific chemical or biological agent. As detailed 
in Figure 14, cleavage of the label will release it into solution 
in which it can be injected into an electrospray mass 
spectrometer for analysis, which will determine the sequence of 
the oligonucleotide and furthermore, its quantity* Sample results 
are shown in Figure 15 . Prior to cleavage of labels one needs 
to separate the Sanger ladder into its component fragment 
lengths. In a mass spectrometry system this stage can be coupled 
to the sample loading in a LCMS system. Separation into bands can 
be achieved by capillary zone electrophoresis. This will then 
pass through a UV spectrometer to determine the quantity of DNA 
in each band. Following this the sample will then pass through 
a photocleavage module to release the mass -labels which will then 
be injected into an electrospray mass spectrometer for analysis 
of the labels in each band. 
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In one embodiment one can probe the immobilised Sanger ladder 
with every one of the possible 256 single -stranded 4 base 
oligonucleotides* Each of these would carry a unique identifying 
label corresponding to its known, sequence of 4 bp. In the 5 ' to 
3' format, the label could be att ached to the 3 ' -OH effectively 
blocking them from further extension, or a separate blocking 
group can be used and the label can be attached elsewhere in the 
molecule . 

One could use chemical ligation rather than enzymatically 
catalysed ligation if that is preferred. This maybe advantageous 
as it would permit probes to be synthesised from DNA analogues 
not accepted by a ligase such as Peptide Nucleic Acid (PNA) or 
other analogues. PNA is desirable as it has higher affinity for 
its complementary DNA sequence than DNA probes. 

Other advantages may be that chemical ligation would allow the 
use of electric fields to regulate the stringency of 
hybridisation of probes electrostatically. 

In a preferred embodiment, a photolysable linker would connect 
the mass label to the 3' -OH which when cleaved would regenerate 
the 3 7 -OH with as high an efficiency as possible. The primer has 
then been extended by 4 known bases and the cycle can be repeated 
to determine the next 4 bp of sequence. This process can be 
repeated iteratively until the entire molecule has been 
sequenced. 

An alternative implementation to using photolysable mass labels 
at the 3 ' -OH of each 4-mer oligonucleotide would be to cap the 
3 '-OH with a phosphate group. One could also use 
dideoxynucleotides or any other appropriate means to block the 
3' -OH of the probe molecules. The mass-label could be attached 
to another part of the molecule from which it can be released 
independently of the uncapping reaction of the 3' terminus. 
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Uncapping of the 3 ' terminus can be effected by washing the 
immobilised DNA with alkaline phosphatase which will readily 
remove the capping phosphate from the 3' -OH leaving it available 
for the next cycle of the sequencing process. 

Conceivably this system could be implemented with other labelling 
schemes, but most other labelling schemes do not generate 
sufficient, unique labels to be practical. 

Sequencing a Povulation of Nucleic Acid Fragments; 

The same process can be applied to a heterogeneous population of 
immobilised nucleic acids allowing them to be analysed in 
parallel. To be successful when applied to a population of 
nucleic acids, this method relies on the assumption that 
statistically 1 out of 256 molecules within the total population 
will carry each of the possible 4 bp sequences adjacent to each 
terminated Sanger fragment. If one sub-sorts ones nucleic acid 
population into manageable subsets of less than 256 fragments, 
one would expect that almost all will have different sequences 
following each terminating base in the Sanger ladder so for most 
purposes one can assume that a hybridisation signal corresponds 
to a single DNA type. This all assumes that DNA sequences are 
random sequences of bases which is not strictly true but is a 
sufficient assumption for the purposes of this invention. 
Obviously 1 in a 1000 is not a small probability and sequences 
will often have the same 4-mer at the same point in a Sanger 
ladder . 

However this invention includes algorithms that can resolve to 
a great extent any possible ambiguities caused by this 
occurrence . 

Reconstructing Sequences of Target Nucleic Acids: 

Analysis of the labels in each band of the modified Sanger ladder 
will generate a matrix of quantities of label corresponding to 
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each possible probe. Shown below is a possible matrix for all 
probes of 4 base pairs in length: 



Sequence to Cycle 1 Cycle 2 Cycle 3 Cycle 4 

which label 

corresponds 



AAAA 5 24 13 7 

AAAC 10 5 9 13 

AAAG 13 9 15 17 



4*4 **** 



TTTG 7 13 17 10 

TTTT 17 10 7 14 



To reconstruct the sequences to which these quantities of label 
correspond, this invention also envisions algorithms for 
analysing such a data matrix. The algorithm attempts to identify 
a sequence on the basis of its frequency, i.e. a sequence present 
at a given frequency will have every subsequence present at the 
same frequency. The algorithm searches through each column of the 
matrix and attempts to resolve label quantities, that may be sums 
of sequence frequencies into atomic quantities such that the same 
set of atomic quantities appear in all columns. The algorithm 
acheives this by comparing label quantities in a given column 
with those in all the other columns. A given atomic quantity that 
appears in all columns is then assumed to correspond to a unique 
sequence . 



If two sequences have the same n-mer at a particular point in the 
sequence, these can be resolved by the quantitative nature of 
this system in that the quantity of a particular n-mer in a 
particular ligation will be the sum of the quantities of the two 
sequences that share the n-mer at the same point . These can be 
largely resolved by comparison of one cycle with previous and 



WO 99/02728 PCT/GB98/02048 

- 49 - 

subsequent ligation cycles to identify such sums. This is made 
particularly simple if the sequences that are being analysed have 
been amplified by PGR such that the sequence in the lowest 
quantity is present at not less than half the quantity of the 
sequence with the greatest frequency, that is to say if the 
frequency range of sequences lies between some quantity N and 2N. 
This means that any sum of frequencies will be greater than 2N 
and hence readily detectable. 

Notice that if the sequence template is a Sanger Ladder, ligation 
and identification of 4-mers to such a ladder will provide 
overlapping 4-mers, thus providing significant redundancy that 
should provide sufficient information to eliminate most 
ambiguities that might arise in individual columns of the 
reconstruction matrix. 

Ligase Chain Reaction: 

A further embodiment of this invention uses a variant of the 
Ligase Chain Reaction. A single stranded template is immobilised 
and then primed at the 3' terminus of the molecule. To this 
immobilised template is added the 256 possible 4-mers in the 
presence of a ligase. The 4-mers are phosphorylated at the 5' 
terminus. For each 4-mer variant a small, predetermined 
proportion of the probe is present with its 3' -OH blocked with 
a photocleavable mass label. The 4-mers will hybridise to the 
template and the ligase will ligate them. However, for any given 
copy of the template there will be a distinct probability that 
an irreversibly blocked 4-mer will be incorporated at a given 
point, thus preventing further extension. 

This will generate a ladder of terminated fragments in a manner 
that is analogous to the Sanger sequencing reaction . The identity 
of the terminating 4-mers can be determined by separating the 
terminated fragments by capillary electrophoresis followed 
directly by analysis of the photocleaved mass labels by 
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electrospray mass spectrometry. 

Here again multiple templates can be analysed simultaneously 
because of the fact that at any point in the sequence a given 
4-mer will occur at a relatively low probability. 

This embodiment permits an additional level of multiplexing. This 
embodiment will give a ladder of fragments where each fragment 
is N bases longer than the previous fragment where N is the 
length of the ligation probes used. Capillary electrophoresis 
will give single base resolution so if the template population 
is divided into 4 subsets, one can prime each subset with a 
primer one base longer than the previous primer. This will give 
4 populations of fragments resolved from each other by a single 
base * 

Ligation of Sanger Ladders to an array of DNA probes and MALDI 
detection of fragments: 

A further embodiment of the invention uses immobilised probes 
rather than mass labelled probes in solution. In this embodiment 
a Sanger Ladder of fragments is generated as described above . The 
immobilised fragments are then released from the solid phase 
substrate and ligated to an array of oligonucleotides bearing all 
4 N probes of length N at discrete locations on the array surface. 
The linkers attaching the probes to the array surface would be 
photocleavable . The array would then be washed and treated with 
an appropriate exonuclease to trim any single- stranded DNA 
remaining- The array would be washed again and a suitable MALDI 
matrix would be applied to the array. The array would be placed 
in MALDI- TO F spectrometer and each location on the array surface, 
corresponding to a specific N-mer probe, would be scanned by a 
laser, first at a frequency to cleave the linker immobilising the 
probes and then at the frequency required to excite the matrix 
used* This will ionise the fragments ligated to the surface at 
the location scanned* These will be detected by Time Of Flight 
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spectrometry . 



For each location on the array one will thus know the sequence 
of the N bases 3' of the termination point in each fragment, the 
mass hence length of the fragments ligated and the quantity of 
each fragment. This will allow the generation of an equivalent 
data matrix to that for the capillary electrophoresis mass 
spectrometry embodiment to be generated. 
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CLAIMS ; 

1. A method for characterising DNA, which comprises: 

(i) providing a population of DNA fragments, each fragment 
having cleavably attached thereto a mass label for identifying 
a feature of that fragment; 

(ii) separating the fragments on the basis of their length; 

(iii) cleaving each fragment in a mass spectrometer to release its 
mass label; and 

(iv) determining each mass label by mass spectroscopy to 
relate the feature of each fragment to the length of the 
fragment . 

2. A method according to claim 1, which further comprises: 

(a) providing at least one DNA single-stranded template 
primed with a primer; and 

(b) generating the population of DNA fragments from the at 
least one template, wherein the population comprises at least one 
series of DNA fragments, the or each series containing all 
possible lengths of a second strand of DNA complementary to the 
or each template ; 

wherein the feature of each fragment determined by each mass 
label relates to a nucleotide or nucleotide sequence at one end 
of each fragment, so that each nucleotide is related to a 
position in the template associated with the mass label so as to 
deduce the sequence of the or each template . 

3 * A method according to claim 2 , wherein the series of DNA 

fragments is provided by contacting the template in the presence 
of DNA polymerase with a mixture of nucleotides sufficient for 
hybridising to the template for forming a second strand of DNA 
complementary to the template, wherein the mixture further 
comprises a set of four probes containing all four nucleotides 
for hybridising to the template in which the nucleotide of each 
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probe comprises a modified nucleotide which is capable of 
polymerising to the second strand of DNA but blocked to prevent 
further polymerisation thereto and which is cleavably attached 
to the mass label, which mass label is uniquely resolvable in 
mass spectrometry for identifying the modified nucleotide, and 
wherein each fragment is terminated with one of the probes . 

4. A method according to claim 2, wherein the at least. one 
template is a plurality of templates and the series of DNA 
fragments is provided by contacting each template in a separate 
reaction zone in the presence of DNA polymerase with a mixture 
of nucleotides sufficient for hybridising to the template for 
forming a second strand of DNA complementary to the template, 
wherein the mixture further comprises a set of four probes 
containing all four nucleotides for hybridising to the template 
in which the nucleotide of each probe comprises a modified 
nucleotide which is capable of polymerising to the second strand 
of DNA but blocked to prevent further polymerisation thereto and 
which is cleavably attached to the mass label, which mass label 
is uniquely resolvable in mass spectrometry for identifying the 
modified nucleotide, wherein each fragment is terminated with one 
of the probes, and wherein each set of mass labels from each set 
of four probes associated with each reaction zone is different 
from the other sets of mass labels; and the fragments are pooled 
before step (ii) . 

5. A method according to claim 2, wherein the at least one 
template is a plurality of templates and the series of DNA 
fragments is provided by contacting each template in a separate 
reaction zone in the presence of DNA polymerase with a mixture 
of nucleotides sufficient for hybridising to the template for 
forming a second strand of DNA complementary to the template, 
wherein the mixture further comprises a probe containing only one 
of the four nucleotides for hybridising to the template, the 
nucleotide of which probe comprises a modified nucleotide which 
is capable of polymerising to the second strand of DNA but 
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blocked to prevent further polymerisation thereto, wherein each 
fragment is terminated with the probe and wherein either the 
primer or the modified nucleotide of the probe is cleavably 
attached to the mass label, which mass label is associated with 
the reaction zone and uniquely resolvable in mass spectrometry 
from the mass label in the other reaction zones for identifying 
the modified nucleotide used in the reaction zone; and the 
fragments are pooled before step (ii) . 

6. A method according to claim 2, wherein the at least one 

template is a plurality of templates and the series of DNA 
fragments is provided by contacting the plurality of templates 
in each of four separate reaction zones in the presence of DNA 
polymerase with a mixture of nucleotides sufficient for 
hybridising to the template for forming a second strand of DNA 
complementary to the template, wherein the mixture further 
comprises a probe containing in each of the reaction zones only 
one of the four* nucleotides for hybridising to the template, the 
nucleotide of which probe comprises a modified nucleotide which 
is capable of polymerising to the second strand of DNA but 
blocked to prevent further polymerisation thereto, wherein each 
fragment is terminated with the probe and wherein the primer is 
cleavably attached to the mass label, which mass label is 
associated with the primer and uniquely resolvable in mass 
spectrometry from the mass labels associated with the other 
primers used in the reaction zone; and wherein each nucleotide 
from its corresponding reaction zone is related to its position 
in the template. 

7* A method according to claim 2, wherein the at least one 

template is four sets of DNA single -stranded templates, each set 
comprising an identical plurality of DNA single -stranded 
templates and the series of DNA fragments is provided by 
contacting each set in a separate reaction zone in the presence 
of DNA polymerase with a mixture of nucleotides sufficient for 
hybridising to the templates for forming a second strand of DNA 



WO 99/02728 



PCT/GB98/02048 



- 55 - 

complementary thereto, wherein the mixture further comprises a 
probe containing in each of the reaction zones only one of the 
four nucleotides for hybridising to the template, the nucleotide 
of which probe comprises a modified nucleotide which is capable 
of polymerising to the second strand of DNA but blocked to 
prevent further polymerisation thereto, wherein each fragment is 
terminated with the probe and wherein each of the templates of 
the four sets is primed with a primer to which the mass label is 
cleavably attached, which mass label which uniquely resolvable 
in mass spectrometry from the mass labels corresponding to the 
other templates and which is relatable to its respective template 
and its respective reaction zone, wherein the fragments are 
pooled before step (ii) , and each nucleotide from its 
corresponding reaction zone is related to its position in the 
template . 

8 . A method according to claim 2 , wherein the at least one 

template is a plurality of templates and the series of DNA 
fragments is provided by contacting each set of templates in a 
separate reaction zone in the presence of DNA polymerase with a 
mixture of nucleotides sufficient for hybridising to the 
templates for forming a second strand of DNA complementary 
thereto, wherein the mixture further comprises a set of four 
probes containing all four nucleotides for hybridising to the 
template in which the nucleotide of each probe comprises a 
modified nucleotide which is capable of polymerising to the 
second strand of DNA but blocked to prevent further 
polymerisation thereto and which is cleavably attached to the 
mass label, which mass label is uniquely resolvable in mass 
spectrometry for identifying the modified nucleotide, wherein 
each fragment is terminated with one of the probes, and wherein 
each set of mass labels from each set of four probes associated 
with each reaction zone is different from the other sets of mass 
labels and, before step (ii) , the fragments are pooled and the 
pooled fragments are sorted according to a sub- sequence having 
a common length of 3 to 5 bases adjacent to the primer to form 
an array of groups of sorted fragments, wherein each group is 
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spatially separated from the other groups. 

9. A method according to claim 2, wherein the series of DNA 
fragments is provided by 

(i) contacting the template in the presence of DNA polymerase 
with a mixture of nucleotides sufficient for hybridising to the 
template for forming a second strand of DNA complementary to the 
template, wherein the mixture further comprises a set of four 
probes containing all four nucleotides for hybridising to the 
templates in which the nucleotide of each probe comprises a 
modified nucleotide which is capable of polymerising to the 
second strand of DNA but reversibly blocked to prevent further 
polymerisation thereto, wherein the step of contacting forms a 
series of templates containing all possible lengths of the second 
strand of DNA, each second strand terminated with one of the 
probes / 

(ii) removing unpolymerised nucleotides ; 

(iii) unblocking the modified nucleotides; and 

(iv) contacting the series of templates with an array of 
oligonucleotide probes, wherein each oligonucleotide probe has 
a nucleotide sequence of common length 2 to 6 , all combinations 
of sequences are present in the array, and wherein each probe is 
cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
nucleotide sequence . 

10. A method according to claim 2, wherein the at least one 
template is a plurality of primed DNA single- stranded templates, 
each at a unique concentration, and the series of DNA fragments 
is provided by 

(i) contacting the templates in the presence of DNA 
polymerase with a mixture of nucleotides sufficient for 
hybridising to the template for forming a second strand of DNA 
complementary to the templates, wherein the mixture further 
comprises a set of four probes containing all four nucleotides 
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for hybridising to the templates in which the nucleotide of each 
probe comprises a modified nucleotide which is capable of 
polymerising to the second strand of DNA but reversibly blocked 
to prevent further polymerisation thereto, wherein the step of 
contacting forms a series of templates containing all possible 
lengths of the second strand of DNA, each second strand 
terminated with one of the probes; 

(ii) removing unpolymerised nucleotides; 

(iii) unblocking the modified nucleotides; and 

(iv) contacting the series of templates with an array of 
oligonucleotide probes, wherein each oligonucleotide probe has 
a nucleotide sequence of common length 2 to 6, all combinations 
of sequences are present in the array, and wherein each probe is 
cleavably attached to the mass label, which mass label is 
uniquely resolvable in mass spectrometry for identifying the 
nucleotide sequence. 

11. A method according to claim 2, wherein the series of DNA 

fragments is provided by contacting the template in the presence 
of DNA ligase with a mixture of oligonucleotides sufficient for 
hybridising to the template for forming a second strand of DNA 
complementary to the template, the oligonucleotides each having 
a common length in the range 2 to 6, wherein the mixture further 
comprises a set of probes containing all possible 
oligonucleotides of the common length 1 for hybridising to the 
templates in which the oligonucleotide of each probe comprises 
a modified oligonucleotide which is capable of ligating to the 
second strand of DNA but blocked to prevent further ligation 
thereto and which is cleavably attached to the mass label, which 
mass label is uniquely resolvable in mass spectrometry for 
identifying the modified oligonucleotide, and the series of 
fragments contains all possible lengths of the second strand of 
DNA of integer multiples of 1, in which each fragment is 
terminated with one of the probes . 
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12. A method according to claim 2, wherein the at least one 
template is a plurality of primed DNA single -stranded templates, 
each at a unique concentration, and the series of DNA fragments 
is provided by contacting the templates in the presence of DNA 
ligase with a mixture of oligonucleotides sufficient for 
hybridising to the templates for forming a second strand of DNA 
complementary to the templates, the oligonucleotides each having 
a common length in the range 2 to 6, wherein the mixture further 
comprises a set of probes containing all possible 
oligonucleotides of the common length 1 for hybridising to the 
templates in which the oligonucleotide of each probe comprises 
a modified oligonucleotide which is capable of ligating to the 
second strand of DNA but blocked to prevent further ligation 
thereto and which is cleavably attached to the mass label, which 
mass label is uniquely resolvable in mass spectrometry for 
identifying the modified oligonucleotide, and the series of 
fragments contains all possible lengths of the second strand of 
DNA of integer multiples of 1, in which each fragment is 
terminated with one of the probes . 

13. A method according to claim 5 or claim 6, wherein the 
plurality of single -stranded templates is primed by hybridising 
to a known sub- sequence common to each of the templates an array 
of primers each comprising a base sequence containing a common 
sequence complementary to the known sub -sequence and a variable 
sequence of common length, in the range 2 to 6, in which the 
array contains all possible sequences of that common length and 
the mass label cleavably attached to each primer is relatable to 
the variable sequence, which variable sequence is relatable to 
the particular template to be sequenced. 

14. A method according to claim 8, wherein the step of 
sorting the pooled fragments comprises contacting the fragments 
with an array of spatially separate oligonucleotides each 
comprising a base sequence containing a common sequence 
complementary to the primer sequence of the fragments and a 
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variable sequence of the common length, which array contains all 
possible variable sequences of the common length. 

15. A method according to any one of claims 4 to 7, wherein 
the reaction zones are separate containers. 

16. A method according to any one of claims 3 to 10 or 13 to 

15, wherein the mixture of nucleotides comprises ATP, TTP, CTP 
and GTP. 

17. A method according to any one of claims 2 to 10 or 13 to 

16, wherein the modified nucleotides are dideoxy- or 
deoxynucleotides . 

18. A method according to any one of claims 2 to 17, wherein 
the primed DNA is immobilised on a solid support. 

19. A method according to any one of the preceding claims, 
wherein the step of separating the fragments on the basis of 
their length is effected by capillary electrophoresis. 

20. A method according to any one of the preceding claims, 
wherein each mass label is cleavably attached to a fragment by 
a linker cleavable in a mass spectrometer. 

21. A method for characterising DNA, which comprises 

(a) providing a primed DNA single- stranded template; 

(b) contacting the template in the presence of DNA polymerase 
with a mixture of nucleotides sufficient for hybridising to the 
template for forming a second strand of DNA complementary to the 
template, wherein the mixture further comprises a set of four 
probes containing all four nucleotides for hybridising to the 
templates in which the nucleotide of each probe comprises a 
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modified nucleotide which is capable of polymerising to the 
second strand of DNA but reversibly blocked to prevent further 
polymerisation thereto, wherein the step of contacting forms a 
series of templates containing all possible lengths of the second 
strand of DNA, each second strand terminated with one of the 
probes ; 

(c) removing unpolymerised nucleotides; 

(d) unblocking the modified nucleotides; 

(e) contacting the series of templates with an array of 
oligonucleotide probes to form a series of fragments, each 
oligonucleotide probe having a nucleotide sequence of common 
length 2 to 6 , and all combinations of sequences being present 
in the array, wherein each probe is cleavably attached to a 
corresponding mass label uniquely resolvable in mass spectrometry 
for identifying the nucleotide sequence; 

(f ) separating the fragments from one another on the basis of 
their length ; 

(g) cleaving each fragment to release its mass label; and 

(h) determining each mass label by mass spectrometry to 
relate its corresponding nucleotide sequence to a position in the 
template so as to deduce the sequence of the template. 

22. A method for characterising DNA, which comprises 

(a) providing a plurality of primed DNA single -stranded 
templates, each at a unique concentration ; 

(b) contacting the templates in the presence of DNA 
polymerase with a mixture of nucleotides sufficient for 
hybridising to the template for forming a second strand of DNA 
complementary to the templates, wherein the mixture further 
comprises a set of four probes containing all four nucleotides 
for hybridising to the templates in which the nucleotide of each 
probe comprises a modified nucleotide which is capable of 
polymerising to the second strand of DNA but reversibly blocked 
to prevent further polymerisation thereto, wherein the step of 
contacting forms a series of templates containing all possible 



WO 99/02728 



PCT7GB98/02048 



- 61 - 

lengths of the second strand of DNA, each second strand 
terminated with one of the probes ; 

(c) removing unpolymerised nucleotides; 

(d) unblocking the modified nucleotides; 

(e) contacting the series of templates with an array of 
oligonucleotide probes to form a series of fragments, each 
oligonucleotide probe having a nucleotide sequence of common 
length 2 to 6, and all combinations of sequences being present 
in the array, wherein each probe is cleavably attached to a 
corresponding mass label uniquely resolvable in mass spectrometry 
for identifying the nucleotide sequence; 

(f ) separating the fragments from one another on the basis of 
their length; 

(g) cleaving each fragment to release its mass label; and 

(h) determining the identity and amount of each mass label by 
mass spectrometry to relate its corresponding nucleotide sequence 
to a position in its respective template so as to deduce the 
sequence of the template, 

23 * A method for characterising DNA, which comprises 

(a) providing a primed DNA single- stranded template; 

(b) contacting the template in the presence of DNA ligase 
with a mixture of oligonucleotides sufficient for hybridising to 
the template for forming a second strand of DNA complementary to 
the template, the oligonucleotides each having a common length 
in the range 2 to 6, wherein the mixture further comprises a set 
of probes containing all possible oligonucleotides of the common 
length 1 for hybridising to the templates in which the 
oligonucleotide of each probe comprises a modified 
oligonucleotide which is capable of ligating to the second strand 
of DNA but blocked to prevent further ligation thereto and which 
is cleavably attached to a corresponding mass label uniquely 
resolvable in mass spectrometry for identifying the modified 
oligonucleotide, wherein the step of contacting forms a series 
of fragments containing all possible lengths of the second strand 
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of DNA of integer multiples of 1, each fragment terminated with 
one of the probes; 

(c) separating the fragments from one another on the basis of 
their length; 

(d) cleaving each fragment to release its mass label; and 

(e) determining each mass label by mass spectrometry to 
relate its corresponding oligonucleotide to a position in the 
template so as to deduce the sequence of the template . 

24 . A method for characterising DNA, which comprises 

(a) providing a plurality of primed DNA single -stranded 
templates, each at a unique concentration; 

(b) contacting the templates in the presence of DNA ligase 
with a mixture of oligonucleotides sufficient for hybridising to 
the templates for forming a second strand of DNA complementary 
to the templates, the oligonucleotides each having a common 
length in the range 2 to 6, wherein the mixture further comprises 
a set of probes containing all possible oligonucleotides of the 
common length 1 for hybridising to the templates in which the 
oligonucleotide of each probe comprises a modified 
oligonucleotide which is capable of ligating to the second strand 
of DNA but blocked to prevent further ligation thereto and which 
is cleavably attached to a corresponding mass label uniquely 
resolvable in mass spectrometry for identifying the modified 
oligonucleotide, wherein the step of contacting forms a series 
of fragments containing all possible lengths of the second strand 
of DNA of integer multiples of 1, each fragment terminated with 
one of the probes; 

(c) separating the fragments from one another on the basis of 
t he i r 1 eng t h ; 

(d) cleaving each fragment to release its mass label; and 

(e) determining the identity and amount of each mass label by 
mass spectrometry to relate its corresponding oligonucleotide to 
a position in its respective template so as to deduce the 
sequence of the template- 
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25* Use of a probe or a set of probes in a method according 

to any one of claims 1 to 22, wherein each probe comprises a 
modified nucleotide or oligonucleotide which is capable of 
polymerising to a second strand of DNA complementary to the 
template but blocked to prevent further polymerisation thereto, 
which modified nucleotide or oligonucleotide is cleavably 
attached to a mass label for identifying the modified nucleotide, 
and wherein each mass label is cleavable from the probe in a mass 
spectrometer, is resolvable by mass spectrometry and is relatable 
to its corresponding modified nucleotide or oligonucleotide. 

26. Use of a set of oligonucleotide primers in a method 

according to any one of claims 1, 2, 5, 6, 7 or 13, each primer 
of which comprises a mass label cleavably attached to an 
oligonucleotide primer base sequence for hybridising to a DNA 
single -stranded template to form a primed template, where in each 
mass label of the set, is cleavable from the primer in a mass 
spectrometer, is uniquely resolvable in relation to every other 
mass label in the set by mass spectrometry and is relatable to 
the oligonucleotide primer base sequence . 
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